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<*ATCCAC»CATCACCCCAACAATCGCAGACCCGATTACAG^^ 

TTCCCTAACAATgI^AAGGTATATATATGAATTTCAAGC 

GTGwIaCAGA^TCCACTGTTATATCCCTCTTTGTATTATTGTAT 
GTCTCCGTCT^AAGTAAAGATGTGAGGATTATGTCTCCTTT 

TTCCTG^TTTGTA^TGTGCCGTTTACAAAGTTAAAGTGGTTTCCTCTAGCAACACCGTTAATGG 
CTMATCCTTCTTGTC^AATTTCTTGTTAAGTTCTATGTTGGTTACAACTATAGATATA 

TATTTCTGCTAGTTTTGTTGAAGTACTGGATAGCCTTTACTGAGTTGAAAGTTTCACGGGAACAGCTTGAACAGTTC 

gaatctaatatctccctcagatctgttcaaaactttccacgactccattggg 
^tgaatatatgcagatgatgcatca^ 

mgagI^gttcctgtcgggggmgtggag^ 
ccagg^a^?cc^cctccacc?cm 



^tctataacct^ 

TATCTTCCACATCTCCCACCTCGTCTTCGTGTTTTTTCCTTTCT7CTTCTAGAAGC 
TACCTGAGAAGCTTCCTGTTTTACATCATTCACA^ 

gt"tgtt?a^cta^^ 

^TAAGGAAG^CGGTGTTGGGGAACCAGTTCCCCATTTTAAAGAAATCAAACAACT^ 

^G^GGAA^AAA TAT TC AGGG AAT GG AAC G AAAG T G G GG GAG AA A AAATTTT C G AC AT AT C T CAG AAT GAAGAAG AATGG AT GG AT AT CAT ATCC T T AG T 

ggaaaSg^tgaacctgtattttctaaatcacttaaacctgata^ 
aaagcggggjwvgaagaagatgaggaagaagaagaagaagaaaagaaacattccatatc^ 

CCTGGCACATTCAATATGGAGGATTTACCTTTCCGCCCTTTCGCAGTACCTTCAACTTATAAGACAGi^TTA^ 
ScCCC^ 

AAATAGTTCCATGATATCATTTGATTCTCAGACATTGAGTGGTGTGTATAAGAACACTGCrAATTACAAAAGGT^ 

GATTTG^CCGTTAGGAGTATGGCAGAATTTATCCGCACTGAAGCGCATAAGGCATTGACAGCAGAAGAGATGGAAGAAGA^ 

AAGAAGAAGCGATGGACCAGGAGCCTGCAGAAGTAGACTTCCTTTCAGTGCCTCATTTACGCCGTAAAAT^CGTCAAGC 

GAATTTGAAGX^TTATCTTCCCATTTATTCC^ 
GGTTGTGA^G^AGGAAAAGTGCTAAGTGTCACCTTCA^ 

ATTTAGTTGAACGAATAAGAGATGTGCTTCAGACTGCAAGTAAGACCGGAAACCTTAACAAAAA^ 
TGAAAATGGCATTGAGCGCCCCGGGGACGATTTTACAGAAAAGGGGA 

GCCAAGGCATAGTAAAAGAGACCGAAGGACTGACAGCTGCAACAGTGGC^ 

tcSg^a^t^ 

ACTTCTTTTTCGTCGTTTGCAAGGGACATGGGCGTCCAAGTTAGTGCCGATTTAGATGCTGAATTTGCTC 

CCCTTC^CAAAACTTGAAAGATC^^ 
TCCAGAAAGGGCTGG<^GAAATTAACGACACTG^ 
AACAAOTTGGAAAGGTTACAATTGTAGTAC^GGCTTCTGTGGTGGCCG^ 
ATCTCAACGTCGTGGACCACACCAGCCCTAAAGGTGTCATCAG™^ 

CAAGCACCCGTTCAGGGAAGAAATAGACCATGTTATCGCACTAGATGCATCATTCTTAACTGAAAATGGAGCATATGTATTC 
AAATCGAAATATAAGGCCTACGCACCAATCTGTGGAACAAAAGA^^ 

TGCCTTGGOTGGCAAGCATGAAAGATTTGCCTAAAGGACAATCCCTCTCCTGCGATAAAGGGATGT 
GAGGAGCTTCCTGGGCATGAAGATTTGCGCCCAGTTTTCTTCCCTGC^^ 

TC AACGACTTTT AT AC AAAAAT AATAAACACAGAC AT CAT AAAAATGGAT AGGT AAAAT AAC AC ATT AAATGTAT ATTTTATATATTTTT^ 
CAATAAAGATTATACATGTATAAAAATTGCATT^ 

TTTCCATCACCCATTCTGATTTTATTCCTATTCTTCTCCAAAACAGATTGCATTCCGTGTCCATCAACATCCGTGTTC 
ATGGCTCCGACATATCTTGTACTGTCGATTGGATAGTTTCAGCTAGGGAATGTTTTTCTGGTCCAAATTCTCCTCTCCC^ 
TTCAGCAGAA<*CAGCAGGCAAAAGTGGGCCCAGCCAGCCTCG^ 
ATAAAATCGGAATGGGTCAAGTGCATGTCCTTTAATTTACCC^^ 

AAGTGGAGTATTTTTCTTGTTTAGAAGCATGTCGTATATAGAGAGCGACAGCTAAGGCAGTTATTGCTAGAATAAAGAAAA 
ATTCATTCGTTTAGTAGTTA^ 

YGATAGGCACAGGCGGATATTCCGACGGTCCTCTTCCGAGAATATTTTGTCTAGATTGAAGCATTGCAGCA^ 
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atcatcacgaggcaaaagattgtcctttgatccttttgataaagcttcgatgtttataaaggaacctaaaagtttccacattatagcaatgtgataaaag 
ctgtccaacgaagcatatacagcagctgggaaagttgtccatattaattgcccactttgtaacgcagag<k:taaattttcaaccccgatcttgttgcttt 

GGCAGATGTTCGCCATCCAATCTAGTTTTGTTATCGTATCTAGAACTGGGGTATTCTTAGCCAGTTTGACTGCCTCCCCTTTCGGTTCTTTGGGGAAGTA 
AACTGGCTCCTTTTCAGTAGAGTTGGTTTTTATGTTTTTCTCTGATGC^^ 

TCACTCTTCACTAATrCTATGTCCCTTTCGATTAGTTGGCACACGTCTTTATCTAGCCTCATCCTGTTAGAGTTTCCTGTTTCGGCTGCTGGTGCTGTCG 
CAGACGCAGTTGCTATTAGTCCATCAGATGCGTCCAGTATGGAATTGGGAGGAGGAGCTGCAACAGATAACATGGCAGTAATATCATGGATGATTTTGGA 
GTAGTCCAGTTCTAATACATCAAAATCCTGTTCTTCCCCTTCCTCCATTTTGTTGGTGTCCATGTTTGAGGAGGGATGGTTGTTGATACTACCCGAGCTC 
ATGCCGTACACCGAATGACGAGAGGAGGTAGGAGCAGGCCCTTTTTATAGCCCTAAAGGGCCATTATCGACAGTACTATTATAGAAGTTATTTTTATCCC 
TGATATTATACAAGCAGAGAAAAATTTCAGTCTACTTGCAGGAGACATCTTTW 

TGTTTCTTGTCTGTTGGTTTTTTGAGTCTTTTGTTGTTTAGCTTATGTATTTGGGGTTGCCGATTACGGTGCCTCTTCTTCCTTTTATTCCTGTCAGCAA 
GTAAAATATCCTTTTCGTACTTGATAAATCTATCAACGTACTTTTTTATTAGCCTGTTTCTGGTCGTGGAATTCCGTGAAGAAGATCGTAGAAAGTTTGC 
ATCAGCAGATGTGGArTCTGTATCTGAATCTTCTTCCTCCTCCTCTCCAAGAAGAGAATCGACGTTAGTGTCTGATTCAGAGTCATATTCTTCTTCTTCT 
TCTTCTTCCTCATCGCTTGAAGGGATGGCCGAAGACGTGGGAGAAGAAGAGGAGGATGAGGAAGAAGATGTAGAAGTAGAAGCCATTTCTTTGTTTGTTG 
TTGTTACCTGCCCATGTTCCTCCTCGACATCATCTTCTTCTTCAATCTTATTCAGTGCACGGTTCAATTGCTGCTGATAATGCCTTGTGTTTGTTACAGT 
GCTAGACCATAAAGCCCTCCTGGAAGCAGCCGATGCTCCAGGGTCCATTTCTGATCGCGTACATCTAACCTTAGCAATAACCAGAAATACGTGCTTTTTT 
TTTACTTTCAGCCAGCCGTGTTAGTCATCTACGGAATGGTGTCTTTGTCTTTAGATGAGGAGTCGTACAGGCAAAATTTGGGTAAACCTTCAAACTCGCC 
TGTGCAGTCATCAAAGCATATTTGTATCAAAGAACCTTCCGC 

TCTTTAGTGAGTGTTAGTTTCTTTCTATTGCACAGTATTATGTCCACATCGTGCCCGATTGTGTTATTTTTATCATTGATGACAGAAATGTCGTAAGCAC 
TTTCTCCTTCTTCTTCCCCCCACTCCCTTTCTGCTTGTTCCTCCTCCTCGTCTTCAAATTCTTCGTCGTCGTCTTCTTCTTCTTCCTCCCCGCCAAACCT 
CTCCCTCGTTTCTTCCTCTTCGTCATTTATTATATGTCTCCTAAAATTCTTGTCCAATAATTTTGTAATGAAAACACACTCTGGCTTCTTCCTGACCTTT 
AAGAAACTTGCATCCATCATGCTGTTCATTCTCATATCACAAAGGCCATCATACCTATCATCTCTGGCCATGATCGCTTTGACGCTCAAGAATATATCGT 
ACATAATCTTAGACTCTTTTTTAGGTGAGAAAAGTTGGTTTTGGAACTGTCTAAAATTAATGAAACCACGGGACAATTTCTTAGCCAAACCCAGAAGATG 
AGAACCTCCGTACCCTTCTCTGTGGGAATGTGCTAGTGCAGCATTATGTATAGTAAAAAGCAAACCAGGATTTTCTTTAGCCAAACTCTGTATAACATTA 
GTTGCCGTCACTACAGGTATGAGATTTGTCATATCTAATTCAGGTACATCTAAACATTCACTGATAGGAATCACTCCCTCTTTCTCTCCTTGACCAACAA 
TTCTCACCGTCACTAGTTGATGTGCATCTTTTATGGCTCTATCATAATCATGGGGGAAGTTTGATAGAGCTAACGTTGTAGCATTTTCTTCTTGCAAAGG 
TAAACACCCTTCGTCTTCATCATTGTCGCGAACATCTTCATCAAAGTCCATGCCGTTATTGTTATTATATGCCATTATATTAGAGAAAACATCCATCTTT 
CTATTCCTAGGACCAGAAGGGAGCATTAGAGCAGTTAATGAAGGTGCATCACTTTCTATTTTATTTGTCTTGGAAGTAACTTCGTTATTTTTCGTATTCA 
AAGTGTCTATCAGGGCAGATGCTGCATC^CGCACTCGCTTCTCGCGTGGGAGAAGATTGTCACAATCAGATTTAAAGTGGACTAAATGCAGACACAAATT 
AGAAAAAACGAATTGAGATGCGGATTTAACAArGGACAATATTTGATTATGAGGAAGTACAGACGTGAAAGAAAGAGCATGCCCAAGATCGTCTGATGAT 
GTGATTTTATTTTGGGCACCTATAACTACTAGCGCTATGTGCATCGCGAGGACAATTACTTCAGTGAAGATACCATTTTCTAACGTCAAACCTTCTACGA 
ATGCAACCCCTCCTCTATTTGTCCTCATCATCATCTTTGAAAAATGTTCCATCAAGTGTACCTTTGTCATACTGGCTGTGACAATCTGTACGATATTCCC 
CATAGTCTTTAGGACTAACCCTTGATCGATAAATTGTTCCCTCATAACTGCGGCAGCAATATCTATGGAATTATTACTGTATCTATCGGCATATGTTCCT 
GCCTTTGAATCTTTGATCCTCTTATCCAAACTATCAAAAAATCTTCTCTCTGCGGGCGAGTCTGTAGAGAATATGAGGAACTGAGGGACTGAAAGTTGAC 
CGTTTCTGTGAGTCCTACCGATAGTCTGTAGGAAAGCAATGGCGTTATAAGGTACATCTAAAACGCAATGGTAACGTTTAGCGTACATGGAGTTATTAGA 
AGAGTCATGTAAAGAGAGACCGGTATTGCCTTTAGGTCCCAGCATTATAACGTCGACTTCTTTCGTGTTGTTGAATGCACTAATACATTTGTTTGTGTTG 
GCAGTTTTAGTGTTGTTTTTAACCAGAAACAATCCCCTATTCGTTATACGCGAACACAATTTCCTGTTTGTTATTTCAGCGTTACTATCCTCCCCTATTG 
ATTGTGTGATTGAATCTATAGGGTTGCCTGCTAGCGCCATGAGCGGTTCCGTGTCAGGGACTGAAGGCACAAGCATAACGTGAGCTGGGGAAGTTCTTCT 
ATTGGTTTTATTCTTTAAGGAGATTGTGAAAGCTGAAGCCATCATGACTGCACGGCACAAAAGTCTATAGCCTGAGAATATTGTATTAGCAACAGGCGAT 
GAATCGAATGTCCCCACGTCTACGATGCCGTATTTTGCGTCACTTTCATCCTTGGCGATCGTTTGTAAGATGCGTGTAGTCAAATTCTTGAGAAAAGAAT 
CCCCAGTTTGCTCTAGTGACATGACCATCTTTTTTGATTCGTTTGTTGTCCTTAAACGCTTCACTGCATCGGTGACAGAAGTTGCCTTGATATTCAATAG 
TGCACCTTTACATGCTGCCACAGAAGTACTTGCTGTATTAATGAAGTATTGTCTCCTCAACTTTTCAAAACACTCGTCGTCAAATACTTCACCCCATCCG 
TCTTCTTCATTAGCACCGTCGTCATAATCTTCGTTAAAATCAGTTTTACC 

CCATATCAATGCTGCTTATGTCAACTTCCGGAGACGTTATTATTCCTCCTCCTCCATTAGAAAGCGCATCCTTTTTCTGTGAGTAGGTTTTTATAGCATC 
TTGTACTATACTTCTTAGTGTTGTTTTCCCATCATGTGCGACATCCCTATCGCGCACAACCACCACCTTIAAAACGCCTGTCGATATTTTCCATTACTTCA 
TATAAATCATCATCATTTGCAGCACGTTTCCTCTTTCTGGACGGGGAATTAATCTCTGCTACTAATTTTTCAACATCTTCCTCTTCCAATATTCCTCCTT 
CTTGGTGTTCTCGTATAATCTTTGTGAAGGCCCTTCTGACATGTCCTCCTACTTTTTCAGAATCGATGAGTACTTGTCTAGCGTTTAAACAGTACGAAGC 
CAATTCGTCTATAGCGTATTTTTGAAGAGGGGAGGCTTTACAATTAGTAGTAGAGCAGTCGACTCCAGCCATGCTAATACTCCTACTAACCAATTGTCCT 
CTATTTCTTAAAAGTTTAATAACCTGTTCCAGAAACACTGTCACGTACTCGCTGTGCTCAGCCATAGCATCTGGAGTAGCACTGGACACTTCTTTGAAGG 
CTTGTATGCTCGTGTAGGCTGGCGCACTTTTCCrTAGTATGTGGTCAATCATGTGCAAATCAGCATTGCTCTGGAATGGCGTAGCGCTGGCCATAACCGT 
GAACGTATCATACTTTAAGATCTCTAAGAAAATACGGAACGCATCAGCTTGAGATAATAGTTTCAAGTAGTTGGACATTGTAACTCGGCCCTTATTATTG 
GATGATTTCATGCGCTTCACTTTAAAGTTTCTGTCGTCACTCATTGCACTCTTGAACCTATTAATAAGGTCTGACACTGAAGGGTTTGCCCTTTTTTGTA 
TGTTGCGCAAAACTCTAGTATCTTCTTCCTCCCAAGTCTTTGCCAGTTCCTTTCTAAAAGAGTCACTAATATTTTGAGTCTTGTGGAACTCGTCACATAA 
TAATGCTGTGACAAAGTTGTCAATGGGCATAACCTrGTTGGAATCGAGGTCTGTTCCTCCTGTTATAAATTTCAGTACAAATTCCAGATTTGTGCGTAAA 
TCAGAGTACGTCATAAATAAAATGGTGGGCGTTTCTGTTAAGAAATCTCCTATTGATCTTGAAGAACGCTTACTATGAAATTCCTTAACATCCCTGAGAG 
TGAAAAATCTAATATCCACCGAGTCTTTAG<^GACGATACAGAATTTTCCACATCTACTGTCATGGACCCACTTTTCCCGTCGCTCTTGAAAGACGTTGG 
TTTATTAGCACAAGGCACTTGTAGTTTTCTCCATGAAAATTTAGGATCACGAAGACCTCTAGAATTTGTAACCACTTCTCGCATGCCCTGTTGACAGCTG 
TTGAAAAGAGGCTTAGAGCACGTCAACCATATAAAAAAGGGGTGCCTTTTCCACACGCCACTGTTGATGGCCGTAATTACCTTGTCCGAATCTTGACCAA 
AAATTGATGGCCCTACGTGTTTTTGTACATCTAAGAGTGCCTTTTCTTGTAGTATGACACTCATGACGAAGGCTGCCAATTCTCTTGTTTTTCCTACACC 
AGTACCATCTCCGATGATAAATCCTCCACTCGTTTTCTTGTCC^ 

TTCCACCCTCTCCCTTTGTAGAATTGAATCAGGGCTAGACGCACTGCTAGTGCCTGCACAGAATTAATGTCCATCGGCCGAGGAAGAAATTTGGGGTAGG 
CATCGGTTACATTTTTGCAGAAGCCCACCGACCGGAATGTTTCACTCTCATAAAGAGCGAATGAAGGTTCTGGGTGGGAGAGGGTGCCAATCTCTTTAGC 
ATCTTCTTCCTCCAGAAATAAAAGTTTGTGAGCCATGGTAATTCAACCAGAAAATCAACCAATATATGTTGTAACAAGAAGTTGCCTTAAGAAGACTCTT 
GTGTACTTTTTTTACGATACTTGAAACGAGTACGCCTGCACGGATCAAAAATCT 

TTCCCATGCTAAAGAGAACCAGGAGTTACGTACAGCCATACTAGTCTCTACAAGTAATCGTCTATAAGCCTCAGAAGTATCCACACCGTCCCTGAACCAC 
GAAGTTTCTTGCACGTCCGCCTTCTTATGCCCCCACTCGACGAGTGACTTCTTGAATAGAGAGAGGGGTGTTATTGTAGCTCCTTCGGGGGTTGAGGAAA 
TGTTTGACGCATCAAACGCTAGACGAGGGTCTGACTGTACTACCGGGCCACTCTTGGGGTTATGATCCAAACGAAACGATTGCAAACCCCACGCATCGTC 
GTCAATATCCACTCTGTACTCGGATGAGGAAGATGATGTTCCGAACAATTTTTCACTTCCGGGAGTTACATTGGAGCAATCAAAATAGCTGGGGATTGTT 
TCGCTGTCAAATATCCTATTGTGGTCTACTGAAAAAATATCGTCGGTAGCTCCTTCTTCGGGAAGAACAGAGTAGGATGTGGGGAAAGCGTACTGCAAAG 
GATTGGACACTTTCAATGTACTGGAAGTTTTTGATGCAGGTTTTCCTTTTAGGTAATCrAAATCAAACATATCTCGCGCTAGCACAGTAGCAGCATAAGG 
TCCGTATGAACGCATTAATAACCCCTCTTTCAATACAAAAGTATCCTCAGATTTTTTTATGATATGTCTTAGAAATTCTTTCCTCTTCATTAAACTATCC 
CAAGAACTTGATTTGAGATATATTTCACTTGTCCTCATCCCTCTCGCGTGTTTCTTTATTACGTATTCTGTCTGAGACAATCTCCTCTCATCGTTAAAAT 
TACGTTCAAAAAAGTCCCTTATTTCTTCCAGCCTTTGGTCACCTGTAGAATTGTACTCTGGAGCTGGAGTACCTCCAGGGGGAGTTCTGATATAAGAGGA 
GGGTTTATTTTGACTCAAAAGTGTAGAGAATAAAAGTTCAGATCCGCCGTGAACACTACCCGAAACCATGCGACTATTTCGCAAACATTCTGTGGCCAAC 
ATACTATGTTCATAGGGTTTTTGAGGTATGTTTCCGGCCAAAACATTAGC^ 

AGGAAGGTTTGTTCAGGACGGGAGCTAAACAACTTCTATGTAACAATGAAGACGGGCGGATGATTACATGGGGAATATTTTCTAGAGGAGGTAAAACAGT 
GCTAGGTATTGGTACAAAGAGCACATTCTTAGAAGTATTTGTTGCATCCAGGTTTAAAGGCGTGATGATGGGCGCTTGGGCCAGTGCAATTTTCTTGGCA 
GTAGTTTCGTCGATACTTGATTTAGGTCTTTGCTGGTGAATCTGGTCCCCTCTTTTAGATTGGTCGGCCAAGAAAATTCCCATGGCAGGAACAGTCCCCA 
CTAACGGTGTCCTGTCATCGCCTATTATTGCATGCCTGTTTGAGTCTTGGGGATTATTAACGCCCGGTACACACTTCATTGTGCTCACATCAAACGTAAC 
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AAACGAAGGGTCTATGTTGGGCATCATAACACAGGGGTTTTCCATACATTCATACATACCTCTGAATTTCATATCCTTTGTTACATCTTGAGAAAGAAGG 

TCGTTATTTAGTACATCCTTCTTATTATTACAGAAACAGGTGATGGGGTTTTTGTATAAAAATTCAGGATCGTTGCATTTAATTTTGAATTCCGCCAAGG 

AAG AAG ATTG AC GC AAT TT GGAGATGT AAG AAG AGTTGCGGAATTCTGT ATG AT TCC GT ATTT GT TGTTTCT TT AAT AC ATC AT AAT AGATTGGTGTCG A 

AGATGAAGGAGGTGGTGGTCCTTTGTGGGTTGAAGGATTGAAAGCGCAAGCAGTCATCGATGTTCCTCCCGAACCTCCGAAATATGTTGGGTATTTAGGT 

CTGCATTGCCAGCCCCCATCATCTGTCAGAACGACTATAGATGTGTGCACAGAACACTCTTTAACTACACGTTCTCCAGTGGTTGGGGATGTTTCTATGT 

GGTCACCGGGCACGTTTCCTGACCAAC^iATAAGGTTTAGGATCTTCTCCAGATTTATGTGTGATAAATGTTCCTCCAGCAACAAATGGTGTCATCACAC 

AACTGCTCTCGAGGTATAACACACAGATGTGCAAGTTTGTCCCTCTTTCAAAGGAACTGGGTTGGAAC^GTACCCGCCAGTATTGTAGGCCAAAATATCC 

ACATCTTCATCATGTCCTTCTGATGTAGATGGAGGAGGAGGAGGAGCGTCACCACTTACGCCTGATGCACTATGGTGATGGATATCATGTACACCAAATA 

AAGAATCGTACAATTCAGTATCGTCTTTTTCTATATTATCCTTCAAATTGACCATCCTTTGTCTAAACTTTTTCCCCGATACAGCATCTAGATATTTGTT 

AGATTTTTCGATGGTGGCCTCACTCAATCTATCTATATGGTGTTCAGGCACGAGAAGAGATGAGGATCTATTTGCAGCTAAATCATGACTTTCGAACCGA 

GAATTCCTTTCCTTCGTTTCTGACGGAACGTTCCATTTATTCCCTTTAACAGTCAAATTCATCATGTCCAGGAGAGCGGGGTGAGGTGGGTAATTGCTTG 

CATCTTCCACCCCGTCATGCCTACCTCTG^TTCTAGGTATTCTTCTAGTGTCTTTTCAGGGTTAGCTCCTTGAACTGCTTTCAATTTACTTTTCATCCC 

TAATTCGTGTATAGCTTGAAAATTAACAGACATTGGTTTGGCTACATTATAGGCCATTGCAGCGCAGACTAGCGCAAAAATACAGCCTAACACGAACACG 

GTCACGAATAACGTTATTTGCGCGGTGAAGCGCATACTTCGGTTGGTAGGGTAAAGTAACGGATCTCTTCGTCGTTCATCCATCGTTCCTGATATTAC^ 

TGGATATCTTCCTAGATAAATCTCATACAGGTGCCACCCTTTTAGGGTCTTATAAGACGCAGACTGTGAAAAATAATTCTACGTTAAAAAAGCTTAAAAT 

GGGCGCGCCTACCAACGCTGATTTTACACGCACGGTATCAGGAGTAGCTTCTTCTCTCTATCTTGTTAACCCTGGAGCTCCTTCCGATAGAGAAAAGTTA 

GTTTTGGCCTCTTCTTATTCTGACTCTTTTGTGTACAACTACAAGGATGCAGTGGrGACCGCTGAGGCTCCCAAGTGGTGTCCCTTTAACGAGCCAGCTC 

TTCATGAGCACATCATGAACAGACTTGAAAAAGCTGGTCTAATTAACAGATCTCGTTTTGTGTGTAACCCTGTTAAATCGGCTGGAGAAGTATGCGGATT 

TCGCTATTCTGGAGGAAGTACTCCCCAGAACTTAATTTTCCCGATTGGAGCATCAGAGATAAATTGGTTGGTCATACTTAGAAATGCTGCTCGTTTTGGT 

ACAGTGGCAGCATCGGCCAAAGACGCCATCGAACGCATTCCTGATCTAAGAGAAGGTGGTACAAGTAAACATGTGGCAAAGAATGCAATGAGGAGACTTC 

GTGTATGGCGAGCCTTTAACTGGATAGCGGAAGCCTCCAGATCAGCGGGCATGATTCGCTACGAACCCTTTTCGGTTGGCTGTGCTCTATACGATCACGA 

GGTAAGAAGAAGGCAACTAAAAGGAAGTTATGAGCGAAATGTACTATTTCTTGGTGACCATTTCTGTAAAGAATCAACTCTTCTGGCTGATATTTC 

GGGGGAAGAAGTTCTGATTTCTGGACGATCGTTGAAGCTGTCATCCGGTACAAGAATAGACATGCTCGAACAATCAGTAATGAAACTAATGCC^ 

AAGACTCTTCTATAAACTTGGAGTGGGAGGATGTTCTAGTAAAAAACCAACGCGACACAAATCAAG^ 

TGCCATCAAAGAACATGAAAGCATAGGAGAAAAAAGGAAAAAACATATCCTAGAGTTTATTAAAACATGTCTAACAGAGGAG 

AAGGGAGTTGGAGG AAAAGGGAATTTGTCCCC GGCCC ATC T AAC AAATCTGGCC G AT GCGAT AC TGGCT AAT AAT GCC AAAG C AGGC AT ATGGGT T ATT A 
TGCAGAGTCTTTTAAAGCAAATCAATTTCTCTATACTCCATTTAATAGGGTACGAAGCTCAACGACTTTTAATGTTCAAGTTGTATATGCCTGCATTACT 
TGCTCTTTTTATATCTCAGAGGGGAATTGGTGACGTGTTTCTGAATGGTGTGTTTAACCTAGAAGTC 

GACATGGTTTCTCGAGACGCCTACAAGAACAATACTAACGAGTCTAATAATTTGGGCAATTATTCCCAATTCGATATCGCTACAATTTATGGTCAAATGT 

CTGATTGTAATGCAATTCCTCTATCTATTAATATTGGAGTACCTCTCCATAAATCGAGGATGAACATGCAAGACATTGAAAAGACCATACAACACATAGT 

GGATATGGGTTTGGCAAAGATACGTGTAGCGGAAGAAAATAGAGTAGCTAGACGTCTTTTACATAGAAGAAAAATGCAAGAAAAGGCAGCCAGGGAGCGA 

GCAGCTGCGAGACAGAGAAGACTAAGAG<^GAAGAAGAAGAAGAAGAAGAGGAGGAGGAGGAAGAGGGAGAAGAAATGGAAGAAGAGGAAGAAGAAGCTG 

GAACATCAGGTGTTAATGGTTCTGGTTATGATCAAGAAGAAGAAGATGATGGTGAGGAGGAGGAGGAGGAAGAGGAAGAAGAGGATGAAGAAGATAGTGA 

GGGTGAAGATATGAATGGGGAAAATAACTCAAGAAAACGTAAAMCACTGGTAATACTTCCT^ 

AATGCGCCCATTTCAACCAAGGGAAAAAGGTTAAAGGAAAGAGATAATATTGGAGGATTTTTA 

TGGAGAGCATACAAAAACTTTTGACGGCCAGGCAAAGAAAATATGTTAAAGGTGGAAAATGTGGAGATAATGGACTACCAGAATTGTTAATTGAAAAGGT 
TACAAATCTACTAGATTCAGTGTTCCATTTCAGAAAGGGGTCTATTCTTAAC^ 

GCAAACTGTATTTGTGATTACTATGAAAGiWiCGTATCAAAAGACAGTAGTAATAATACACCCCATTCATCTGAATGTATTGAGCGGGCAAGAGAAAGGG 

ATGCTTCCTGTGCCGAATCTAACAAACGCCCTTGTCCTGTAGACTCTAACAACCCTGAAGATGTGGAACAACGTATGCGGGAATTGATCATGGACCCTCC 

TTCATTATCAGGGGTAGAAGATTCTCTCGCTATCGAAAGAGTACTTCAAAATGAGATATTATTCACAAGTCTTGTCACCAATCCAATTTTCAATGCCGTG 

TTAGGTGCTGAAAAAGGAGATCTTGGAAGATTTATCGTATTAAATAACATTGTAAAATTTATGAATATGACTATTGCTTGTCTAGTGGACGGAGATATGC 

CTATGCTTCTAGACTCAAGAGGCAAGAO\AAAAACCTTCTAGAAAAGGGTACAGTGAAAAACACGAGAAAATTTTTCAAACCAAATATGACTGC 

ATTGAACGTGGCCACTGCTCAATCTGCAGGACACCAATACATGAACGCTGGTCATTGTCCAGAACCTGGAATCAAGCAAATGTTACTCCCTGATTGTATA 

ATGAAACTGAAATCCATCGCCATGGAAAAGGGTCGAGGAGGGAGATCGGCCCTTCACAGACAGAAATGTGACCATGCCTTTTGCAAGATGCTGAAGTGTT 

TATTCTTCAATATTGACCCTTCAAATGCTGCAGATACATTTATTGACCCTGCGTCACGTGCCACCTTATTTAGGCTTGATGATCTTTGTAGGGATAGGAA 

AAAATACAAGAACATTGACTGGGTAAAGGATTTACTAGACCCTGTAATGAAGGGAACAAATAAATGGGTGGGAACTGGAGAATACACTAACATTGGACGT 

GACTCAAACGTGGCCGCCCCTGTTGATTTTTACACAATTTTGAAATACACAATGATTGATGATGGTGTAATAAGTGTTCCTTCACGGAAACCAAATGACG 

TGTATTACAGTACTATTGAAAGAGCGGATGACTTACTCACCGAGAGCAGGGACGCTTCATGTGAAAGCTACCGCCCGACCTTGTTTGACGCCAGAGCGGT 

GTTGGAAGTCAACGGAGACGGACGTGTCCCTTACCCTTCCAGTGAAACCGTAGAAGACTTGGGAGGAGAAGAAGAAGAAGAGGAAATAACAGGCGTTATT 

GACGACAGTACTGAAATAGAAGACGTTCAAGTACAGGACAGTAATCTATTCGATGTGGA^ 

GAGAAGAAGAGGAACTTCCTTCAGCAATAlXrrGAAGTGTTTGCTTCATTACCAGCTGATAACGATTCCTCCTCACCCGCGCATATTCCCTCATTTGGGA 
TTCTGAGGAGGGGGAAAAAAGTCCAGAACCGTACAATATTTTTGATTCTGCCCTCGACCAATTATTGGATTTAATAGATAGCGATGGAAGAAACAATAAC 
CCTAAAAGAGTCGACTGGAACAGTGTCACCATTCAAGAATGAAAATATTACCAGAATAGCGTCTAATTACGTCCGAGCCTTTACTGACACGTGGTCTCAT 
TT GGT C A AC ATTTCTGG AGCTCCTCT AACTGCTG AGAAG AAT CC AAGCGCT ATCC C AGC T AACG AAC TG A AT AGAT ACT GG ACC AAAACT AACGTGTT AT 
GCAACCCACTCTTTAAATTGGAGGACCACATAACGAGGGATGAAGATACTGGTACAATAACACTAAAATTCAA^ 

ATATCAGTCTGCCGTTTTAATGCTGGCTCTCGATTCGTTCGTTTCGCTTGCATCTTTTTCCCATGGAGCTGATTTGGTTTCCAATAAAAGTGAAAACAAA 

TTCTGCGTAAAAATTCCCCACGACACTCGCGCTGAATCTTTACTGAATAATGTTGGATTCCCCGCTGGATTAAGTGGGCCTTTTAAGAGATGGAGCATTA 

ACTACAAGGCTGCAAACTTGAGCGGTAAAAGCGGTATAGATGGCCTTTCAGGTTCCATGTTGACAGTACTAAAAAATAACACCAATAAAAGAGCAACTGA 

TATTTTACATTTGGTGAATAATGTTTCTGCCTCTGCACAACAACTTGACGATTCTGAAATGTCTCGTACTTTTAACCACCAAAAGAAAGTTGGAGTTTGT 

TATGATATAAATGTGTCCAGTTCAAGGCAAGTGAACCAGCGTAATTTACTCCACCATCAGAATATAATAGGACAACATCTGATCGAATTTAGAACCAAGC 

AACTTGAACGCGCACAAAATAAAAAAGTCAAGGAAGAGGAAAATGGTGAGCATGAAGAAATGACAAGTGAGGAGGAAGAAGAGGAAGATGAATATGAAGA 

AGGAGGTTGCCTATCAGATATTGACGAGGAAGATTTCTATGAGGATGGTTACGATGAAGAAGAGGGCGATGACAATAGAACTAGAAAGAAGAAGAAAATG 

GAAGAAGATGAAGAGGATGAAGAAGAAGAATATGACGATGAAGAAGATGAAGAAGAGGCAGAAACTTGTGGTGCTAATGGTGTTATTGATTGTGAAGACG 

ATGCAATCATTTTCCCCAATGGACAAAATTCAAAAAGGAAGAAAAATGGTAAAAAAACAAACATTAAAAAGCGGTCACGGAGGAAGGGGGAGTGCTCTGC 

TAACACTTTATCCTTTGTGGAAAAATACGTTGGAAATTGTAAGAGCCTAGGTATAAAGCCAGTAGGGTGTCCGCCCCCATCCACTGAATTTACTTCCCTA 

TTTATGAAGGGGAGTGAAGCTGACAGCTGTTATAATACTTGTCAGTCCACCAGAG^GGCTAGCCGTATAAGGTCACTACTCAATAAATACTCTGTTAAAG 

ATTTGATGCAGGTAAACAGCCCTTCGAGTTGGAAATGGGCTAACCCTCCCGACCGACGGTTCGTGCTGTTTGATAAGAAAACTAAGGAAGAAGTTGAGGT 

TAAGTTTGAAATTGAATGTGAAAAATCCGAGTATTTTGATGTCGTATCTGAACTCCCTAGTAATATTAAAGTATGGTTAAAAGAGACGGCAAAAATAATA 

AAACATTTGGCTCTGATTGAAGACTTTCTTCCAGCTATGGGTGCTGCTACCCCAAAAATTCCCCTCAATTTGATTAAAACTATGACGAGCATTTTCTCTG 

TTAGAGATATTGTTGGATTTAAAATACC^GAAGAAGTGCTCAGTTTTATTCCTATAGAATGGAAGACATCTATTTCTGCAATGGGGCTCCTCTCTOT 

ATTTGATCGTATAATAGAAGTGATAGATTTAATGATAACTAATGGCGCCTTTGCGACGTCATGCTTGAACAACGCATTCTTCTTGGAAAGAGGAGTGGTG 

CCCAGAGATGGGAGTAACACGTGGCTCCACACGGACCTTGTGCAACTCTCCACCTCCATATTTAGAAGTATTCGCAACAGAGGAGTGAATATTGGCGGTA 

ACAACAACACTGGTAGCAATTCTTCTAGTTCTTTCTTGTGGAGGGAATAAGGGCGATTATGGAGTACGTTGTGGATTGAGTATAAGCAAGCGTGGTATAAC 

CCTGAAACCACCACCTGCAGCGATGACTAATTCTTCTTCCCCGTCATCCTCGGCCATGATCTCATTGCCTCAGCCCACGCGCCAGAGCATAGATCTTTCG 

ATAACGACAATCATCCAAGATTTCTCAGAAGTTTCTGGGAAATTGAGGCTTAATGGATTACAGAAAAACATGTCTGACAAGAGCAAAGACGTGTTTAATG 

ATGCAATATACGACTCTGGCGCATTCAAGGCGCTCCTAACATGCACAGTCAACGATAAAAGTAGACGTAAAAGGAAAAGAAGGACTTTATTGGCATCTGG 

AGAGGGTGT GGT ACG AAG AAAC C TGATGGTGAGTC AGGGC AATG ACGTC AATGATGCCC ACC AGTTCC AGG AAG AATGCGG AAT AAAAAT TGGGGGCGG G 

GCTTCTAGGGTGTATAAAAGAGCCCAGCGCCGAGGTTCGGCAGTCAGTTCCAGAAGAAGAGTAAGGAACAAACCCCAGTTTACTATAGCAGTCTCTGACG 
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AAGACGACGACrGCGAAGAAGAAGGCGACTTTTCTTCCGAGTTAAATCCAACGCACTCTCAACTACTTCTATTCCAACAACGGCAACAAGATAGCTGCAC 
AGAAGACGACGACGTTTTAGTGTCTGTAGAAGAATATAACAACAGAGTAAGCGGTTCTTCCACCACAGCCGGAGACAGAGTTCTTGCAAAGGATCTTCTC 
TCTACTGTATCTCCGAACGAAAAGAGGAACTCTGCCGCCCTCGCCGCACTCACCATATCCCGGCACTCTCTTTTCAACGCTCTATCTGCAAAAACAAAGT 
TGGGAGAAAATGGACGTTTCTTCCTATAAGAGCACTATTGACTACCACAACATTGAAGATATGGACGATCTCCAGCGCGCCACCTACAAGGATCGTATGG 
AGACGGAATTGGTCCTCGAGATGGCTAAGAAGGAGGGAAGGTACGTCCGATCGTTGGCCACCATGGACGAATTGGAGGTACCTGAAGAACCAGCCACTTG 
CTACACTTGCGGCTACACCTTTATTAGACGCAGGGCACCCCCACCAAAACGCAAGTCAATATTCAGAGAGCCTTGCGCTTACCCAGAACTTCTCCCCGAT 
GCACCATCCCCCGTCCGTTTAGAAGAGCTTGTCGACGTGCCAGAAGGAGCGAGTTTTTTCACCTACCCTCCCTACGACGACGGATCTTCTACATCGTCTT 
CACAAGCCGAATGTGAAGATGATTATCCTCCACCATACGACCCATCAGAAAATCCACAGAGGTCCCAAGTGTGTGATTATTGTACCACACGTCAAGTCCT 
CAGT TCT ATG AC GGATC AC GC C AGGGC C AACCTC AT AAAAAATC T GAAG AGGGAG AAG AAGGCC CTGGGTCTTGGCC GTC GC AAC AACTTT AGC TACT AG 
GGTTGTACAAGAAGAAAATAGGATTGATTGTATCGATGACGAACTTTGCATGTTTTTGACCATTCTTTTGCTTGCTGAGTGTACATAGCTTTATATTTGC 
CAATAAAACAGACCCCGAATGTATTTTTGTGTATTCTTTATTACTGCCTATTTTTATGGTTTAACGGAAA^ 

GAAGGGAAAAAGAGGGCGCTAGTTCATATACTGCCCAATAGGTGGTCGGGTCCAGAAACGTCTGTCTGTTCCAGAAATGTGTTTAGAGATTTCTGGACAA 
GTCATTTCTGGAAAGGGTTGTAATTTATTATAGTTGGTATATTATTTCTGGTATACATTTCTAGCTCCTCTCTGTAATCTGACGTTGGGTCGACCCAGCG 
GTCCACCCTTCGAACTTGACATAAG<3CCTAGTCAGCGGTCCACCCCCTAAACTGGAGTGAGCTGAAAAAATTTTTGAAAAGTTTTTGAGATGGAGATGAA 
GGGTAAAAAGAGGGTTGAAAGGATAGCTGAGGCCACCTGGGTCAGCCGTGTTCCAGAAACGTCTGTCTGTTCCAGAAATGTGTTTAGAGATTTCTGGACA 
AGTCATTTCTGGAAAGGGTTGCAATTTATTAAAGTTGGTATATTATTTCTGGTATATATTTCTAGCCCCTCTCTGTAATCTGACGTTGGGTCGACCCAGC 
GGTCCACCCTTCGAACTTGACATAAGGCCTAGTCAGCGGTCCACCCCCTAAACTGGAGTGAGCTGAAAAATTTTTGAAAAGTTTTTGAGATGGAGATGAA 
GGGTAAAAGGCTAGTAATAGAAGGTTGTGTAACTCGGCATAACCCAGAAACATATATCAAAAGGGATGGAAACAACTATTCTTTTTATACAGTAGTTTTA 
TTGTACCAAAAACTCAGAAATCTCATCACATGACAAATCTGAATGTCCCATTCCGTGGCCCTCAATAACGTACGACACTGGACCTGGGAACTCCCACCTA 
CATGTTTTACATACAGAAAATTTTGGCTTAGATGCATTATCAACAAACACTTTTACCACATTGGCATTCATTCTCTTCAGTTTGTCACACATCCACAAAG 
AGGCCATATTCTCCATAGAATCAGATCCATCAACAAATATATAGAATGTTACACCCTTGTCGATCCAGTCCATGGCGACAATAGTAAAGTCTTGTCCAAA 
AACACTGTTTCCGAAGATTTCTTTGTCCCCATCCATGTATTGGGAAAAAGTACAGTATGGTCCACAGTCCTTGATCTCCTCCAACCAGTCGTTCAAAAAA 
TCCCTAATAAAGAAAGACAGGCAATTCGACTTGATCTGGAAACCAATGTOGAGTTTCTTCGTATTTTCTTCATTGTTGGAGGGGAAGAAATACGCCAATT 
GTGTATCTGCTTCAAATGCTCCACACTTCCCTTTATGGATCAGGCGGTTCTCCATAATCTCCTTGCCAGAATTGGGG 

ATTGGGGCGGGTAGGAACTTTGCCAGTCAAGTCCATGTTGGCAAAGAGATCAGCAGGGTCTTCAAAGACTGAGGCCATTTTTTTGGTTG^TTGCTGCTGAG 

AGAGGAGCAAGTAGTAGTGTGGTTACTAGCAAGGAGCTCACAGTCTTATATATAGACTAACACCCCCCCCCCCAT^AAATTCTCCCACATCTTTCATACCG 

GGGTCGGCTAGAACCCAGAAACGGTTTGACTCACATCTGGTACATTATTTCTGTTATACATTTCTAGCCCCTTCGTGTAATCTGACATTGGGTCGACCCA 

GCGGTCCACCCTTCGAACTTGACATTCGATCGACTCAGCGGTCCATCCCCTA^^ 

GAAGGGGGAAAAGGGTTAGTGAATACAGGCAGTTCTGCCTTGCTCCAGAAATGATTCTAAAGATT^ 

TTCGTGTAATCTGACATTGGGCCGACCCAGCGGTCCACCCTCGGAACTTGACATTCGATCGACTCAGCGGTCCACCCCCTAAACTGGAGTGACCCAGAAA 

AATTTTTGAAAAGTTTTTGAGATGAAGATGAAGGGGGAAAAAGAGGGTGCTAGGGTGCGCTCCAGACATCTCTAGAGTGGCTAGAGCCCAGAAACATCCC 

AACTAGTTTCTGGGACATTTTTTCTCCCACTGACGCAGACTATATAAGCTAACCACTAAGCATATTTTGCACACATTTCATCCACCATCACTGCCGAATA 

CACTCTTGTGTTGCTCCGCATAGCCTAACAATGGCCCCAAACTCCTTCCAGAAATTTGCTCCAGTTATCAAGACTGAGAAGAAGGAAGAAGAAAGGGATG 

AACATGACGACCCTTTACGGCAGATTGATTTTAGAGATAGAAAGACATTAATCTGCCTCACTGCAAACTGTGTTTCGAGGAAGAGAAAAGCTGGATCTGC 

ACATGATCGAGTATACAAAGTACTACGCTACGGGAACCCATACAAGTACCGTCGCCCCAATAGAACACATCGAGGATTGGCCCTCTCAATGGATCAAGGT 

GAAGTAGGAACATGCCTCCCTCTGCGACCCATGGAAGAGACTGAAGAAAACCCCATCGACAAGTGCGGAGTGGCGTrCCTGTACTCCAACTACAATGAAG 

GCGATGGCATGACCCACCTTTACAACGACGAAGAGTATATAAAGAAGTGCAAAACAATTGAAGGAGGAACAAGAACGTGGGTAAAGAA 

ATACTTCAGACAAGCTCTAGAGACATTGATGATGTCCCATTCTATAAAACAATATTCCAATTTTATTTTTTTCAAGGAGGATATGGAGGAAGGATTTGTG 

CAClAAACTCCACACATTTATTAATATGGTACACCCTAAAAAGGTGTCTGTT/rTGTGATATAATAAAGATTGATGTTTTTATAAATGAAACGTATTATTTA 

TACCAAAAAATTTATTATGATGTATTTCATTGGGATGGCTTTTCACTCACCAATCTGGCCTATGTGTGTGTTCATGCTAGCCCCTCTTTCACCCTTCATC 

TCCATCTCAAAAATTTTTAGAAAATTTTTCTGGGTCGCTCGAGTTTAGAGGGTGGACCGCTGGGTCGGCCTAATGTCAAGTTCCGAGGGTGGACCGCTGG 

GTCGACCCAATGTCAGATTACAGAGAGGGTCTAGAAATATGTACCAGAAATAGTGTACCAGTTATAAGAAAATGTGACCCTTTCCAGAAATGGCTTGTCC 

AGAAATCTTTGAACCCATTTCTGGAACAGACGTTTCTGGAGCATGGTCGACCCAGATATCTGTTGGCGGCTGTCAATAGTAGCCTATGTATTCAATGGTC 

TT TT ACCCTC TTCTC C ATC TC AAAA ATTTT TAGAAAATTTTTCTGGGTCGCTCG AGTTT AGAGGGTGG ACCGCTGGCTC GGC CT AAT GTC AAGTTC CGAG 

GGTGGACCGCTGGGTCGACCCAATGTCAGATTACAGAGAGGGTCTAGAAATATGTACCAGAAATAGTGTACCAGTTATAAGAAAATGTGACCCTTTCCAG 

AAATGGCTTGTCCAGAAATCTTTGAACCCATTTCTGGAACAGACGTTTCTGGAGCATGGTCGGCCCAGGCAGCCTCACAGTTACCTGTTCACACCCTTCC 

TACTAGCACCCTCTTTCACCCTTCATCTCCATCTCAAAAATTTTTAAAAAATTTrTCTGGCTC^CTCGAGTTTAGAGGGTGGACCGCTGGCTCGGCCTAA 

TGTCAAGTTCGAAGGGTGGGCCGCTGGGTAGGGCCC^TGTCAGATTACAGAGAGGGTCTAGAAATGTATACCAGAAATAGTGTCCGATTATTAAAAATTT 

AACTTTTCAACATGTGGCATACACCGTATGAACGTATAGCCAAATATGTACCACAGTCAAAACATTTAGGTGAGAAAGATAATACACATGTATAAATTTC 

AAATACTTTATTTGCATACATAATAAAAAATCATACAGTTTCGGACTCATCTGTCCACACATCATTTATTAGGAACTGGTATCTATTATACCCTATTCTT 

CT ACGTT T AC AAGAT AC TT GTT C AT C AGGC AC TGCCATT CTT TT ACAAGACACT ATT GGT TC AACC AAC ATC TCTTT AACTTG ATC AGC GTCAGGTC TT A 

GGGAAGTACGTTCTCTGCATACAGGACACCGGGTAATTTCTCCACGTTGCGTACTCAACACCAATCTGTCCCAACATTTTTCATCAATACAATGTCCACA 

GCGTGTCGTCCTGATGATATTGGTACTACTGAAATCTTCTAAACACACTGGACAATTATCGCTGTCGAAACGCACATTAGCATTCTTCTCATGTTCTTCT 

AGAGCCCTTTTAATTTCTTCCTTATCCCCAGAAATTATGTGTTCAGCCCAAGGATAATTTGAGGATGCAATATACCCCAAGCGACAATTCAGTATTGGCC 

AAGAACGAGGTGCTTT ATTG AC ATC GAGAC C GGTGGC AGC AAC AAGGGGTTCTTCTTCTTGG AG AG AGAAAT CT AAAGC AGCTTT C AAT ATGGTGGGT AT 

ACTAGGCTCGTATCCTCTCTGGGCAAACTGTGTTAATGCGGTATCATTGAACATCAATAATGCTGCTTCAGTTACACTTAAAGTACTAGATTTACCATAC 

AACATATTAATCTTTGCCCCATGGGCTCCAACTAGTGCATCAGAGACATCGTGTTTGTGATTGGCGATGGCAATCATTAGGGGCGTTGCACCATCGCTCG 

CCTTCTAGCATTCACTATACTTCTATCGAAATCGTATATTATATTCTTGTATTTTTTAGCAAGAGCATCAAGAAAATTGCACGAGGTTGCAACAGCTGCA 

GCATGGAATACGTTTTCATTATTTTCATTAATAAAGGAAAAGGCOIAGCTGTTTGTTGTTAGAATTTCAATACCTACTCTTGTGGTAAAGGGGGAAGATA 

TTGCAGCGCAAACCATTAGAGGTGACATGCCTTTCGCAGAAGGCTTCAGAATGTTAAGTTTTTCTTGTTTGACTAGTTTGCTAGTATGGGAATATTTGrT 

AATGATACAATAACACAATAGAGTATTCCCCTTTTCATCAAAATAAGTGTGTTTGAGCCACGGATTCTTGGTAAGAAAAGCTTCAAGATCTTCC^TGAAA 

CACGAGACTTTTACACACAATCTAACATAGCCTTTAACTTCATGAGGGTCTGTTATCACAACGTTTTTATGTGATACGATCCTTTTCACAATAAGATTAT 

CGCACAGTTTGTCGGTCTTTTTTCTTACCATAATCTCCCTCAAACAGATCTTGAAGTAATTGTAATCAATCTCTTCAAAGGGCCTACTGTTAATCATCCA 

C AAC AGT AT TGC ACC AC AC AAACC ACT AC C CAC TGCTTCGTTT ATTGC AACATTTTT T AG AAT AT TGTC C ATC ATTTCGGC AGAG AATTT TGGC ATCGTG 

TCAAATATACTGTAAAGGGTTTGAAAATGCTGGATTGTTTCAGCCTTCTCTATAAACACATCAACAAACTCTACTGCCATTTCAGCAACAGTTTTTGTAG 

TAGTCTCACAGTACTCTTCTACCATTTCTTCTTCTATCGAGGAAAGAAAAGACTTGATAATCACACTCCTGTAAGCACTAGCCCTTTCTTCGTTGGTCCA 

ATTCAATCTAGCTATAGATCTAGCGTCCATGTTCATTTCTTCTCTGAAGTCAGTTGACACGGATGATGGTGATGTTTCTAGGCAAGAAAAAGGTCTCCCG 

ATAATAAAATTGCCATTGGATATCAGTCGTTTTGCCTTTGTAACACAAGGAGATTCGTCCACAAAATACTTGTATCCGAAAGATATGTCAAAAGGTTCAA 

GTGGTGCAGATTTTTTCATTTCAGCCACGTAATCAGAGGTGATATTGACGATTCTTGAAAAGAGCCTGAATCTAATAACACTCGAACATTTTTCAACGTA 

GAAAACAATACCACTTCTTGCAGAACTAGTAGACTTTTTCAGGCTAGCCAAAACACCGTCCAACTTCTTGATCCTTCTCATAACCTTCTCTCTTTCTTCC 

TCAGCCTGTTCCTTTGAAGTAAACTTGAATCCAGTTCTGCTGTCATCACCAGTGCCAAACTTGATGCCGTGCGTCTCGCGTCTCAAAAATCCATTATCCA 

TAGAGACCAGAAGAGAATATTTTACGAACAAAAAGTCGTCGTGGATGTTTTCGTAAAGGCCTCTGAAGGTTTTGCAGACGGTTGTCAATGCGTTGATAAA 

AGTC ATTCC C TCGC AGATGGGGG AAG AATC AG ACTTGGT ATTGTTGTTG AT AAAGAAGT AG AT AAT ATCT CT AAACTCT TC T TT ATT GTC T AAT T TCTTG 

AAACTACTTGAAGGAACAGGAGGAGAATTTCTGGAGGTAATTATGTCATTCAGAAGGGCCAATTCCCTTTCTGTGAAAACGTCCAGAAATGACATATATG 

GTTCAATGTTTTCAAGTACTTCTTCAAGCACCTGACGGTATCGTGGAGCTGCTTCAGCCATGTTGATGATGTCTCACATACGACTGTTGAGTTTATCCAT 

GCGT ACGCC CGCTTTT AT AC AAAGATC CC GTGT AAGAAACTCCC TCC GG T TC AGTTC AGG AT AGGGGTGT GTC C C AGTTTT AC ATCC AAAGT T AAT AT AT 

TTTTTTAATATAACAAAAAAATCGTACCGCTTATTGGCTGCTATAAAAGAGGGAGCACCTGCTCACTTGGACATCATTAACCATCATCAATATGGAGGGA 

GAACATCAATATTTGAACCTAGTCAGGGAGATCCTAGAAAGAGGAGTGAAGAAGGACGATAGAACTGGAACAGGAACTCTATCCATTTTTGGACCCCAAA 
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TGAGGTTCTCTCTTCGAGACGACACTATTCCAGTTCTCACTACCAAGAAAATTTTCTGGAGAGGAGTTGTGGAAGAACTCTTGTGGTTCATCAGGGGCAA 
TACAGACGCCAAAGAATTGGCCAAGAAGAAGATACACATCTGGAACGCTAATGGGTCGCGGGAATTTTTGGACAGTAGAGGGTTATACGATAGAGCAGAG 
GGAGATTTGGGACCCGTATACGGATTCCAATGGCGTCATTTTGGTGCTGAATATGATACCTGTTCTTCCGATTATACTGGAAAGGGTATTGATCAATTGG 
CCAATATACTAAAGACCCTGAGAGAAAATCCAGATGATAGAAGGATGATTATGACGGCATGGAATCCTATGGATCTTCACCTTATGGCTCTTCCTCCATG 
CCACATGACTGCTCAATTTTATGTGGCTAATGGAGAATTGTCGTGCCAGTTGTATCAGCGAAGCGGAGATGTCGGGTTGGGCGTGCCCTTCAATATTGCA 
TCATACTCTCTTCTGACTCATCTGATGGCCAGTATGGTGGGTCTAAAACCGGGAGAGTTTATCCTCACTCTTGGTGACGCACACATTTATAATACCCACA 
TTGAGGTGTTAAAGAAGCAGTTGTGCCGCGTCCCTAGACCATTCCCTAAGTTGAGGATTTTAATGGCTCCAGAAAAAATTGAGGACTTTACTATCGACAT 
GTTTTATCTTGAGG5GTATCAACCACACAGTGGAAACTTGCAGATGAAAATGGCTGTTTGAATCATGTTAAGGAATTTCCTTGTTACTCATTTATTCCTA 
GAAATGGTGTAATCGCTGTTGTGGGCGGAGCATATTTGTGTATATAAGAGCCCGrGTTAGCTCCTCGATTCAGTCACAAGAGCGCACACACACGCTTATA 
ACTAGCTCTCTCTCrcCACTCAAGATGGCCTTTAATTTTGAAGACTCTACAAATCTCTTTGCCAATATGGACTTGACGGCTGGCACAACAACAGACCCTA 
CCCGCCCCAATATCATATTCTTTGAAAGTCTACTCCCCAACTCTGGTATTGAGGrGATGAAGAGGCGTCTCGTACGGCAAGGAAAGTGTGGGAATTTTGA 
AGCAAGTGGAGGTGCTATGTCGTATTTCTGGCTCGAAGATAATGCAGAAGATATGGAGAATCTCAACAGTGGTTCCCATGTCAAGACAAACTGCTTGGCA 
TTATTCCTTCAAGAGTTTATCAGCAACTGGATTGAAGAGACTGATCGACATGGACAGTACTGTACTTTTCCCCAATACATGGACGGTGGGGATGGTTCAC 
GTGGGGGATATTTTACTTCGCTAGCCATGAAATGGATGGCTAGGGATGTGACTrTCTTrGTGTTTGTTGATAGGAArAATACTGTAGAAAATGCGGCATC 
CATATGGATGTACCAAAAACTACTAGCAATTGGTGCAAAGGTAGTAAAGGTGATTGTTGACAATGCATCAAACCCAATGTTTTCTGTATGTAATGCGTGT 
AGGTGCAAGTACCCAGGCCCAGTGTCATACGTTATTGAAGGCCATGGAGTGGGTCATTCTGATTTGACATGTGATGAGATTTCTGGATTCTTTGTATAAT 
AAAACCCCATAAGAAACAATAATCTTTTTTATTOU\CACCCATGATTTTAGTTTT^ 

CGATAGCTCCTTCACCTATATAGAGGAGGTGGATAAAAACAGAAGATATGGATTCTAATACTTCTATTTTACCGCCAAGCAAACGGCCAGGGTTAAArCT 
GTTACAGGTTTTAGGGATTATAATAACGGTAGCATTAATAGCTTCCGTTTCATCCTTTATATTTTATAGGGTAGGTAAACGCAAATATTACCCTTCTTCA 
TCCTCTTCCTCAGAATTATCTGATGTAGATAATGGGGTAGAAGGAGGAGGAGGAACAACAACGACACCAACTCAACCTTCACCTGACGGTGGAGATGGAT 
ACGTAGATCTTTCTCCTCAAAAGAAGGCTGAACTAAGAACTAGAGTTGCAAACGTCATCTT^ 

GGCAATGAATGATTCAACTGATAAGATAATGGAAGAAACTGAGGCGAGAATCAATAACTTTTCAGAGCCATTCAGAGAAGCAACCGTAGAACGTGAAGTG 
TTTAAGGATGACACAGACAAAAACTTTATCCTTTCAACTCTAGATTTA 

AAAATTTTGACTATGAAGACATGACCCGTCTCATCTTTGATAACATCCCAGAGACTGATTATTTATGGACAACTCATTTCGATCCGAAAAAATATGACAC 

GTACTCTGAAAAGGTATTAGGGTTCTCAGATATAAATAGTATAGAAAGAATATCCTCTACATTTTATAAAGGTAAAAAATATGAGGTAACTACTGGAAAT 

GTAGCTGTCCTCGTTGATTTTGAATCTGAAACAATAAAAGAGAAGGCAGGAAATAGTCTCATCCGTAATGTCGAGTTrATTGTTGTGGACGAACAGACCT 

ACAAATCTTTCTTCCCTGCATTCAATCAAGTTTTCTTCTCCTTTAAAGTAAATAAGGAGAAAAGGGAAGTTACTGTArCCATCAATAATGGATGTGTAGG 

TATAGTGGCCAATATTACTCCTCTAACTACGCCAGTTGGAGCAGCTTCCGGACACTACATCTATGGCACTAGCACAGCAAAGGAAAAGACCTATCTATTT 

GTAATAGACAAGTACGATACCACTGAATTTGTTTGTGGTCTGAGTAACAAGTCAACTCCTCTCATGGCTCTAAATATrCTCTTTATGAGTGATACTGTTT 

TCCCTTCATTTGACGAAGCAGAAAGACCTCTGACGGATGCCAAGGCAGTAGAAArTTTAGGTAAAAGACTAGGTGTAGGAAGATACACAAACGCCAACA^ 

CAGAAATACTCACTGAGATGGAAGGGGTTATTTTGGATAAGATAGAAACAATTGCAAAAAGGGCCTCTCCTAGTTATGGGTCTATTGATGTGGGTACGGC 

TATTTTGCGCCGTCAATTCATGGAAAAAATTAGGGGTAAAATAAATGAAGAAACCACAATGGAGAAGATTATGGGCACAAAGGAAGAAAGAGAGGACACT 

ATAAGAAGTATAGTGGCTAATGTTATCAAAGAGAATACTGTTAAAGAAAATGTAACCGAAAAAATTAGAGCAATGACAGATAAGGAATTAAATGACAATA 

GGGAATTTATGCATGATTTTGGAAAAATTTCAACTGGAGATGGAGGAACCTTCCATCTCTTTGAAGATACACCGGGTTTTGAAAGTGCTTTAAAGGCAGA 

ATATAAAAACGTTCCAGGAGCAACTACTCCAAAATACGTATCTATGAACAGTTTACGTATCGATGCGATTAATGGAAAAATCGAAGAGGTTTATAATCCT 

TCACCTATCATG<MTATTAGAGAATACGGCACCATTCGCAGGGGCAGGTACGAAGAAAATGCAGGTTCGAAAGAATTGGTTTTTATGACCAAGATTGA 

AAAGACCCAATAATGTAGCTGAAAATCTCATTAT^AGAGTTGCCAACCAGCAGTATAATGTTATGAGGATGGTGTTTTTTATAGACTACGAAACAAAGAA 

GGGGGTGTCCAAGGAGGAAATGTTTATACCATATAATGTTCAGAAAACAAAGGCTCTTAAAGGGCGTAGTACTTACTTTTCATTCGTAAGGAAAATTCCT 

GATGAACCAGAAGGGAGTATCATAATACACGCACTAGGGTTTTATTGAGGAAATAATAATAATAATAATAATAATGGCATTACAGGAAAAGGATATAACT 

ATAGGGAATGTTTCTGCTGCCCTACGAGAGTTGATGTACTCACCCACACATATGCAGCATCACGATAAGCTAAACACATTCCTGGACAGAAATGTTGAAT 

CATCTTCAGAAGAGAAAATAAGACAAATTGTGGATAAAATACGATCCCAAACAACATCTGACATATCTGAAACAGTCAATAA'PGTCACAACTAATGGGAC 

TGCATTTTCCCTTTTCGAAGATACCTTAGAAGGTATGGTGAAAAAAAATATAGGTGArAACCTTCAGAGTGGGGACTTTATTGATGGCCGTAAAAAGCTC 

AATGACATGAAGAGTCTAGCTACTGGAGCCATCTTATCTAGACAGCGAGATTT 

TGGGTTGTGGTATTATAAGGTATACTGTATTTGTCAATAACCTTGCAAGATCAACACrCGATAATGATGATGACAAGGCAGCAACCTATTATAACACCCC 

TATATATGGCGGGTATTGTAAAATGGCTATAAAGGACTATGAAATACCAGATTCGTACAGCAAGGTCGAAGCGGAACATACAGTTGAAGGAAGAAAGATG 

ACCTTTAATATAAAATGGAGAGGCGATACCATAAACAACCTAATAACAATCATCCCTTCAGTGACAGGTTATCTTGCTTCCATCTCTGAAGACGCAGATG 

TGCAGGCGCCATTACTTTTAAACTGCAACAACTGTTTTATAGAGGCAGATATGAGTAGCCTCTACATGGATGAGAAAAAAACAGAGGCATCATTTACCCT 

CAACTTACCGGAAATCGAAGGAGCTGATGCGAATGCAGTCTATGAAATATGTATAGTAGTAGTTTGATGGAGGACAAATAAAAAATTGGATACATTTCAT 

GTAGTTTTATTTAACTCTTAATCTTATATACATTTCATGTACATTTCATTAAAAATCACTATCAGAGCTATAGTACAGAAACTGGTGCTGGAGCTGATTA 

ArCCTCCTGGCCATGACGGCAGATGTCCTCCGTGAAGTGTTGCGTCTGCCGTCTCTGACCCTATCAACCATCCTACCTGAACCTGGGTTCAAATTAACAT 

TTGGTGCATCAAAGGGGGTGTGAAGAGGTGAAGAATAAGGAGAAGGTACAGGACTAGCATCATAGTCCCTCACAATCCTTGTTCTTCCCCTTCTAACAGG 

TGTGGATGCGGGAGGGGGCATATTAGCCGTGTCTAGCAACTCCAACATCTGTGCCTGGTATTCAAGTGCGTTAGACACAAACCTCTCAGTGTCTTCTGCT 

TCATTGTCATTATTAGGAATAACACCATAATCTTCCATGAGGGCGTCACGCCTTCCTGCCTGATGCCTCTCAAAGAGCCGTTTTCTTTCCTCAGGGTCAT 

CAGAGAAGAGTTTGTGAACCCAGCATGGTTTCTTAGCCTCCCCTACAGCCAAGTCAGGCACAAAAACTGGCTTATTTTTTACATTCTTATAGGGAAGAAG 

AGGGGGCAACACCTTGACAGGAGCAGGAGTGGGGGGTAAAGGAGTGGTATATTCACCCTTATATGAACTAGGTTTATGGAACTCCTGGCAAGGTAl^ 

CCCTCCCTGTCCTCTTTTAGAGGGGCGGCCTTACAGCCTCCAAAAACCTTCTTAGCTATTTTAGTATGCATTTTCTCAATCCTTTTCATACCTTCAGACA 

ACATCATGAACCCATTCTTATCAAACTGAAAACCCCTCAAGAAAAACTTATAAAAGGACTTTTCATTCTTTGCAAGAAGACATCCCGCCATTAGCCTAAT 

TGCACTCAAAACATCCCCTATCTCCCCTTCCCCATACTTTATTTCAGGAAACCTATrGAGTGCTGAGCAGGTACTTTCTATAGATGCTCTCAGAGCATCA 

TTTTCCAAGGTTGATGAGCTATAATCAACCTTCTTACAGCTGGCACACTTCTTAACCATATCCTTTATTTCCATTCTTTTCTTATGTTTCTTCATCATTT 

CTGATGAAGAGGAAGTTTTGATTGATGTCTTAAAGACGATATrCTTCCCTTTTATTGTCTTTCCTTTCTTGCTGACGATACACCCTCCCTTCTTGCTGCT 

GCTGCTGCCGCTGCTGCTGCTGCTGATGATGTCAGTTCCGGCCATTTTTAATGACTCTAAAAAACAAGGGGGTAACACATGTAATAAAATGCATTATAAT 

GTAACATTCTTTTATTTTAATAAAAGGCAGTAGAAAATTTGTACATCATTTACAG<^AGGCATATACAAAAACACATCACTTTATAGATTGGGCAGGGAA 

GATATTTGATCAGCTTCCGAAGGGTCAGTATCAAACCCATTCAACAATGAACTGTGTACATGTGCAATTTCGTCATCCAACCTAGCATTATTAGAGGCTG 

ACTTAATGAGACAGGAACCAATTAGCTGCTTATTAGCACTCCACTTTGAAGCGTACTTACGCTTCACCTTCAATTTCTTAGAGCACAAACTCTTACCAAT 

TACATCATTGACAGTTGTATGGAACAGGTTACCTCTAGATATACAATCTCCAAGCTTATTCTCAGTTTCCTGTCTCCTTTCTCTAAGAGCAGAAAGTTCA 

GCCAGATACCTATTGGCAATCTCATTATACTCCTCATCAATATTATCGAGTCTACACATATACCTCTTGACATCTACACACAGTCTCTTACTCTCAGAGG 

AAAT ATT AGAC ATCCTGT CC TC GTC AC GC CC AT ACCTTGGCCTC T TTCT ACT GTTACT ACT ACT ACT ACTACTTCCTG AGGAAGT AG AGG GCTG ATTC AC 

TGATGACGAAGAAGGTATTACTTCTTCGTCATCAGAGTCTGAATCAACATTAAAAC^AAGGTTAACATTAATGTTAAGAGAGGTTGAGGTAGGGGCGGAT 

GAGGTAGGGGCGGATGACGTAGAAGCACTAGGCATATTCAAGTCAGGCAAAGACATTGTGTCATCATC^TCATCATCATCATTAAC^TCATAAAGTGAAC 

TACTAATACCACTAATAGTATCAAAGGTAATATTGTTACTTCTAGGGGGAACAGTCAAGTCAGGAAGTGACTCTGAGTCATCATTATCATCTACCTCTGG 

GGGGACAAACACTACAGTATCCTCATCATCTTCAATGAAATTGAAGGATGTCACACCAGACAACAAACTAGGGGAGACATTATTATCACCTTCATCATCA 

TCATCATCATTACTCAAGCATTGAGTAATGTCAATAGTATTGGGAGCTACATACTTAAGCCATTCACCTTCATATTCATAAGGGGGGATATCCTTGAATr 

TACCCATTAAAGCTTGCCTCATGATTATCTTCAAAGGTCTCTTCACTGAATGC^ 

ATAAGCCCTCTCCATATAAACATCCTCATTCTCATTAACACCTTTGGCATTGGGGACAAAGAAAGTAGGTGCTAGAATAGTTGACATCTTATTAAGGGTA 
TCCTTTACACCTTCATCATTAACATAGTCAGCAGACAACAAAAATGAGTGAAAAGAGGCATACTTGTACACCTCTTCATACACTTTACGGAAGAACCACA 
CAGGGGGTATATAGGCCCTAC^CATGGGACATAACAGGTGATTTTCATTCACCATATGCTCTTCAAATCTCCAGGTGAAAAGACACTCCAGGTGGACACT 
TTTCCTGCAACACGGCATCCCTCCATTTGACCAGTTATAGCAATCCGTGTCCGAATCGTAATCACCCATACAGATGGGACAGGTAGGCACCTCATCAACG 
GCGACGGTTTCAGCCATCTTGTTGTTGTTGTTGTTTACTTTAAAGAGGGAGAAAAGCCAACGCTATAGCAACTCATTTCTTTACTTTAAACAGGTTCATT 
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ACAGTCCCCTTAAAAGGGAATATGTGCTTCATGGCAAGGGCATATTCTGTGATTGAAGGGCGATTTTGATAGAAATCAGTGTCTGTTCCGCTTTGCAACA 
ACATCAAAAAGTGCATGTTTGACATCAAATTAAAGGCAGGGCCTTCCATGTTAGAACAAGCAACAACATC 

ACTGTACGTCATTGAATAGGCACCTAGACAGTGTCCCCTAAAGAACTCGGGAGGTGTGTGAATGTACTGCCTGATTAGTTCTTCCGTGTTCCTAAAATAT 

TCATCGTTGTAACAATAACCTAGAGGGTATGAGAGTCCCAAGTCAATCATTTTAGGTTGGCCAGATATGCTATCTATAACAAAGTTATCAGCCTTTATAT 

CGACATTAATAATTCCCTCATTTGCAACTCTTGTAATTACATTACAAGTCTCAGCCAAAACAAAAGGCATGTTCACCTTTAGTTTTTCAAACATACTT 

TACAACCTTTCGGGCAATGAAAGAGTCTGTTATTGGTGCGACTGTATCAACCCCCTTTACCTCTGGTAAACGTCCCATCCTTTGAAGCATGACAACCCCA 

TACAACAAGCGTGCGTCCCTTGACTCATAATACTCCCCACCATCTAAACTATCCTTGACACACCTAATGACATCATCAATACTGATACCTGCAAATGGCA 

TCTCTAAACATAACCCCTGAGGACACACACCAGTCACCCCAACTACCCCCTTAATATTCTTTCTGTATACAGAAAGGCCATTAACACACTCAAAAACAAA 

CTCTTCATAGTTCATATCCTTCATAAACTTGACAACAGTCCCCCTATTCACAATGTAATAGACCCCATATTTAATCTCAAAAAAGGTCTCCCAGGGACAG 

ATGAAAGTGTTAAAGrCCTTCACATCTTCAGGCACTTCGACCAAATCAGACAAACAACCACGAGCAGTCTTACGGAAGTTTTCCCAGTTCTCACACTGGT 

CAGAAACTTCCOTAATTAGCGTGCTAGTCTTTTGTCTAATTCGCTCACAGGCACTTGAAGGCAATTGGAACACCTCTACAGCCATATACATTAT 

AATCTGGGAGGACGGAATATTACGACTAGACAACTCCTTAAGAATGTGTCTAGTGTCAAAAAATCTCATCTC 

AAGCCAACTTTAGGGTCCTTTCCATAGAAACAGGTATCATGGACATACTTGCTAGACTTAACAATTGACCTATGGCCAATTCTCAACTGTTGACAGGTAT 
CATAATTGTCCTCAAGACTACAACTCTTCTCAATGCCTTTAAAGTAGTAAGGGGAATCACGTTCAATCCACTCATrATACTGCCTC 

CACCTTGGACGCACACACGTTTGAAAAGGTGCGAGTAGTGGGAGGAGAGGGAGCAACTTCAGACTTCTTGAAACGrTTAGGGGGAGGGGGCGTGATGCTA 
AGGTTGATGACGTCATCCCAAGTCTTGATAGTGATGTTGTTGTTGTTGTTGTTGTAAGCGGGAACAGTTGTGGCTrCGGAGGTTGAACCAGACGCGTAGT 
TGTTGTTGTTATATTCTTCCACAGGCCGTTTTTTATTCCCCTGATGGTAAACATACTGCTGGTGGTGGTGGTCTCCACCAGTATTGATGGTAGTAGTAAT 
TACAGTGGGTCCCCCCATTGCCTATGGAAGAAACAACAACAAATACACGATTAATTCAACGCCCGGCACACTAATAAAGTGATGTGTTTTTTATATAAAA 
AATAGCCCATAATATTTAACGTTACGTTAATCAAAAAAAACAACAACAATAATAGTATATTATTGOT 

AGTAAATCCAGAAAACAACAGTTTTACTGTTGGGCACTACACTGCACTGGTGACGTAGTAGTAGAAGTAGTAGTGGTAATAGTCCACTATTGTCACTGCA 
CTGCACTAGTAGTAGTAGTAGTAGTAGTAAGAGCGTTGGATAAGCTCGGGTTGGCTGCTAGAGCTGGAGCTGGAACTGGAGCTTGTATGCTTGGCTGTGC 
CTCTGCTGAGACTGATGCTTCTGGCCTGCCCCGGCTCGCTrATATACAAGTTGTCCCCCCTCACCCCCACTTCCAGAAATTTGCCGTCGAACGCCAGTTC 
TCCAACAGAGTGGGGTCCAGATCTGGCTACGGGTTCACTCCTGGCCATACCCTGTTTAGGGGTCTAAATCCACCCCCGGCACTAAATGGGGAGGACCTAA 
GGGTGTT AT AACAAC ATGT AAGCGTTGGGT AAGAAGATC T GG ATC TGG ATGACCCAC CTTGTCCTTC TT ATCCTATCCTT AT CCT TGTCC CCTGTCT ACC 
ACCACCTCACCCCATATCTCTCACCCCATCTCACCTATACCCCCATCTCACCCATCACCTCTATATTCCCCCACCTTATTCACTCGCTCCAGTTTCAACA 
CCCTGTTCTTGCCGAGCCAACCCATAACCAGATCTGGACCCCAGTCTTCCCTTTTATCCCTAACCGGCACCATTTATGCCCCCAGGCGCTAGCGGTGTAT 
ATAAGGCGGCGCGGCCAGGCCAGAAGCATCAGTTCTCTGCAAGCCAGCAGAAGAGCAACACAACAAGCACTCTCTCTCCTTCTACCTAGAAGAGACCTGC 
CAATACTCAAGCTACAAGAATGGCCTCTCCAGCCCCCGCCGCACCAAGTCCTTACACCATGTTGGACTCTAAGTTACTTAGTTCTGAGGAACTAAAGGAA 
CTAACTTCATACGTCTCGACTAGCTCTCGCCGGTCTGATATGAAGAAACACTTGCTCCATCTATTCGAGGAGCACGAGAAGATCTTCCAATTCATACAAG 
GTAAGCACAAGTTCTCACTATAGACTTTGGACTTTGAAATTTTCTATGTTATGCTGAATATT^ 

TTTACTCTTTGACAGAAATCTCCAACCAGTACGGAGACTATGGATGTTTCACAATGGCCCCGCCTCACCTGAACGCTGCAGCCGATCTCTTGGATAAGGT 

GATGTCCGGACCTCTATCTCCCGAAGGCGCCCAAACTTCGTCCCCGGCTGCTTGTGTTGGTGCGAAGGTTGTGAAGGCACTGGTGAGCTTTTGCCAGAAG 

ACCCGCTTCACCACCAACATTGTGATGAGAGAAGTTAAAGCCATGGAGTTCCAAGGAGACGATTTTAACTACTCTGCCTTGTGTGCAAGTATGCCCCAAC 

GCCCCGTGACTGAGAGGCAGATGTTCGCCCTTATGAAGAGTGAGGACGAAGAAATGGGAGTGTCTGCAAACTTCTCTCCAGTCTCTGATGACGTCATCAA 

CCCTTCAAGCCTCCCCTCTGGACAAGAAGTCGACTCATCAACTTCCGCTCAAATTTCTGGTATGTTTCAAAACGTGTGGAGTTTGCTTGAAGAGTGTGGT 

AGTGGCTCTAATAGTAATAGTTCCCCTGTCTCTAGGACAGTCTTAGTTTGCACCCTGTTTATAATCCAAGTGTTCAAGTTTTTGGTGACTAAAGTGTCTA 

ATGTGAACGTACTTAACCAGTTGTTTGGACATGTTGTTTTTGGATCACTTGATGTGGCTCCAAGTAATAATAATAGTGTCCCATCAACTGTTGTTAACAA 

CAACAACAAACCCTCGACCTCTAATAATAGTAACAACATCAGTAACAAGCGTGTTGGTGGTAGTAATAACAGTGGC^GCGGAAGATCAAAGAAAGTTACA 

GCCACAGCCAAAAATCCCTTTAATAATGTAGATGGGGACAATCATGGCATGTTTGCCGGTGC^ 

AAGTTGAAACTCTTACAAGTAAGAGCACCATCCCTAAAGAAGAGGTAAATGTAGATGAAGATTT^ 

AGAAATTCATACCTTTAATGTGTTCATCTCTGAGATTAACCCCTCCAAATATGACCGTTCAATGTTTTGCAAGGGATTTTTGACTGCATGGGATAAGTTT 
GTAGAGGGGGATACTGCTGGCGTTAAACGCTTCCGTAACTATATCCTCACTCGCTC^ 

AGGGGACTGTTTATTATAATGACAAGTCAAAGTTTCTGTTCCACGATAATGTTAACCCTGATCTGGACAAGAGCTGGGGTAACAAGAATGGGAAGAAACC 
TAGACTCCCAGCTAACTTGATGGCATTCATGGGTATTGACATTGTAAAGGTGTGCGCTAAGGGGATTCAAAAGTATATGTTTGCAAAGCAATTCCAACAT 
CCGGAAGTGGAAGAACTTGTGCCTCCTATGGCTGTATACGCAAAGGTTGCCGCAGGATTGAAGTCGGGGACTTTGTTTGATGACTGGGACCTGCCTGAAT 
ACGAAAATTGTCAGTTTATCAAGTATGACACAGAAGGGTGCAAAAAGCACAGTGAGTTATACGCCA^ 

TAAACTGGAAGAGGGACAGAGTGCATTCCCATTTGCAAATATTGTGACGGTAACATCCGCCTCTAGTGATGATATTCACGGTGACACAATCATTGAATTG 
ATGTACAAGACAAAGGATGGCGTAAAGG<5AGTCTCAAAAATTGAGGACGAAAACATCATCAAGGTGA7VTCCAGCAGAAGAAAAGA 

AAGCCGAGAAGACCCTGTATTTTGAGATTGATTCCGATGATGAGGTGTGTGAGAGAACAGAGGAAGAATTCTTCAGGCCTACATCTGTTGTTGCTGCCCC 

GACAACACCCCTCGTACCTTCTAATGTGGAGGAAGAGGAAGAGGAAGAAGAGCAGATGGAAGAAGAGGAGGAAGAGGAAGrAGAAAGGGAAGAAGGATCT 

GATAAGGAAGATGACGGAGACGCACCAGCACAGGAAGAAATGGAGGAGGAGAAGGAAGAAGAACAACAACAACAGCCAGAAGAAGAAAGCAATGGTAATG 

AGAACCAAGAAGAAGAACAACAACAACAACAACAACCAGAAAGAGAAGAGGAGAATAAGGATGCAGATAGTGACAGCGACAGTGATAGCAGCAGCAGCAG 

TAGTAGCAGCAGTAGCAGCAGTAGTAGTAGTAGCAGCAGTAGTAGCAGCAGCAGTAGTAGC^GTGAAAATGAAGCT 

CCTGCCAAGATTCJVGJ^GAGAAAGAGGCTAAGTGAAAGGCCATCAGAAGCTGCTTCCTC 

TATCACCATCATTGGACATACTCCAGACTGCAGTTGJVTGAGATGATGGAAGAAATTCCTGCGCCTGAGCCTATCGTTGCTACAACCTCACCCAAGGCAGC 

GACACTTGCACTCAAGACAGGATTTAGTTACTCTTCATTCGTAAGAGGAGATGACCTTTCAGTAGCTGGTAATACTTCCCCTACTGAACCAGCAGCTGTG 

CCCGCTGCTGCCACTTGCACTTCCGATGTTGGAAATGACTTTTTGGACATGTTGGACGGTTTACCTGGCGATATAGTAATGCAACCTGGCGAATGCGACG 

TGACCGCAAAATTCT^GAGGGCATCACCCTACCAGATGGTACTGATAATGAATGCACAGGTTTCGATGATCTTCTTAAAGCCACCGAGACTGATAACAT 

TATAACCACCACATGCTTTACCTCCCCGATTCACCCTTCTAGCAACTCAGCCCCCAGAAAGGATATTGATAATTGCAGTTCTATTAAGAGGTCTAGGGCA 

GGTTCACTTTTTGACACTGATGATGATAGTGAAACAAATGAGGTTGAAAAGGAAGCCCCTAAACGTAAGAAGCACTTGAAAAAGAGGCGTAACAAGTCCC 

ACCGTGGTTCCTCTGGTTCTGCTTCTTCTTCTCATTGTATGAGTAGTGATGAAGAATCAGAGGATGAAAGGGATATGAAATCAACATCAAAGGTTCACAA 

GTCACCAAAAGCTCATCTTAAACATTCCCCTAAATATGATGCTGTAAATAGTGATGTAAATAACTCATACAACAATGTTAATAGTACAACATGCATG 

TCATCAGATAGTGATGCAGAAGCACAGCCTAAAAGCXATAATAAAAGCCACTCTCGTAAACACTCTTCTTCCTCCACAAGTGATAAGAAACAGAACC 

AATGCTCAATCAATACTCAAAATGTCAAGAAGACTGTTGTACAGTCTCCAC^ 

GTCACGCAAGCACACAAAGCCAGTTAG<^CCTATAACAAGAAGCGTGATAATGTTAACACCACTAATAATGTAGTACAGAGGTCTGCCTGACCGACTCAA 

ATGATACTCAATCAATGTACAATAATAATCTTAGTACTTAACAAGAAGAACTATATTTTTATAATATTTTTACATGTCTTAATAACAAACAAAATAAAAG 

AAAACCAATGTATTATATGTTTAAAATCAACCCCATTTGCATGATTAAACTAACTATAGTGTGTAAGGAAAGAAAAAAAACATGATTATTTCTGCCATTA 

AAACAACAACAAAAATTCTAAGCTTCTCCTTTCTTTCTGTGTCTTTGCAGATTCGGGAGTCTTTCTAAAACCACAGCAATACAAAAAACAACACAAAT 

TCTACTGCTCTCCCTCACCAATCCCCCTATCGCCCTTATGCTCCTTCTCCTCTCTATACTTCCTCCCCATTCACTTCTGACTATCAATTTTCTGATTCAT 

TTCAACAACAATGCCACCAATACAATTACTACCCCACACCCCCTCCATCTATGTCCTCGACGACCTCTCCATCGTCTTTGTGGGATGATGACGATGATGA 

TGACGAAGAAGACGAAAAAGATGTCAAGCAAGAAGTCTCGAACCGTCCCCCCATTTTTTCGTACATGGAA^ 

GATAACAAGGGAGAAGAAGAATGTTTTGGATCAAACTTTGATATGTTTGGTGATTCAGATAACATGCCATCAACTTCTACTGCCCCTTTCCCTCCTCCCT 
CT AC AACAAC ACCACTTCCTACTCCTCGATCCATCATGGATACTGATTCGGATGAATGTGACGAAGAAGGAGCAGCAGCAGCATCAGCACCGTCTATTGC 
CGCCTCTTCTTCTATCCCTGTCGGGATCTCTGAAGCTGAATTGAAAAAAATGGAAAAGAAAAAGAGGAAGGAAATTAAGAAACTCAAAAAGATGATGAAA 
GATCCTCTCCCTCACCTATATGTAGGAGGAGAACCTCCTGTCGCAGCAGATTATAAAACAAGGGCAAACATTTCCCTTTATAAAGTTGACCCTAGTATCG 
ATATGTGCGGTGTCGCCCCTCCTCAATTTTGCGCTGAATTGCCCACCCCATCCATAGATGTGTATACTTCTTCCTATGTATTTCCTCCTCCCACACCTGC 
CATGCATAATAAGAAAGGGTCCAAGAAATGTCAATTCCTTAAGGGGAGAAAGGCTTTGAGGAAATGGATTCACGAGMTGTATGCATGGCCCCTCCCGGT 
AAAAGGGGAGGTGTATTTTTGGCTCACTTGGAACAAAGATTCTTGGCTGAACATGGAGATGAATACAAGGTCCCAAGGATGTTTGTTTCAAGAGTATTGA 
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ACAAAGCTTTCCCCAATCTGATTGCTCGTGCAGACACACTGTGCAGTGATATGACATTCTATACTAACCTTTGTTGGATAGTTAATGGAGTTGTCGTATG 
CTTTGATAAAGATGATGGAGGAATACATGGCGATGCGTCAGAGTATGCAACAGGAGAAAATTTTGATACTGTAGTGTTCCACAAGAGGGAAGAACAAAAG 
ACCAATGGGAGTGCCAGTAAGAAGAGGCGTCTCACGCCTGACACTAGTAATATGGGAACAAGCACTGATGTGCAAGAATTCCAAACGATGGGAACAAATA 
CTGATATGCAAGAATTCCAATCAATGGGAACAAATACCAACCCCATAGAGACTTCATCAGTGGGTGTGAATACCAACCCACTTCCCAACCCTCCCCCAAG 
ATTGGTAATTACTCCTTTAACGAATGATGTACCAGAATTGGACATGATGTGGCTTrATTCGCCTTCCAGAGGAGGTGGAAATTCTAGAATGAGTGCAAAT 
ACAGGAACATCTCCCCTGTCTAACACCCCAATTCCTACCTGCTTCACAGGAGGTGCGAATGTAGTAGTGCCTAATGGATTTGTCCCTCCCACGTTCCCCT 
TAGAATGTGACGAAGATGATCCAAGTATTCCCAATTCTTACAATTACGAAGAGGATAAAGTCTTTCATCCATTTTATGAGTATATGGCCAAATATCTATC 
CCCTCTTGTTCCATCATATAACAAGGGACAGACTTGTAATGTTGTCCAGGAGTGGTTCAAGGGATCCTTCTCTCTTGCAAAGCGTAGAGGAACAGTCCCC 
AAATTCTGTAGTAACATTTCCCACGCTTTCTTTTGTAATATGGATGTATGTACTGCCATGTGCAAATGGGCGAAGACTGTAATTAGACATGGACAATATT 
GTAATAGATGTATCGTAAGGAGGTCATGTACATCCATGCTCGCATATCACTACATTGTTTGCAGAGACGCTTCATGTGATGTTCCCAAGTGCAGGGAAAG 
GGTTCGCAACGACATGGATGACTGATTGATTGGTTGATATGTGACATTTTCTGTATATTGTGTAAATAAGATACCAATAAACTAATGTTTTATATATGAT 
TCTATTTTTTTAAAAACCTTTAAAAATATACATATAAAATGATGTATTTTTGAAACTACACTCTGGCAGAATCAGACCAGACCCCTGACCTAAAGCAGAC 
CACAGGGGAGTCTTAGAGAGGGGTGTGAATCTGGCTAGGGGTTCATCCCTCAATGTTAATCACACGCAAGTAAAAACACCACTTCCTAGAAAGGGAGGGA 
AGGGGTGACTTGGTGATATCATAACTGGGGAATTTCCTCTCCAGATATCTGGCTGTACAC^ 

TATCATAACTGGGGAATTTCCTCTCCAGATATCTGGCTGTACACGTGTGAGCGCTTCTGGGCGCGACAAGAAAAAAATTAGTGATATCATAACTGGGGAA 
TTTCCTCTCCAGATATCTGGGTAAGAAAAAAAATTAGTGATATCATAACTGGGGAATTTCCTCTCCAGATATCTGGGTAAGAAAAAAAATTAGTGATATC 
ATAACTGGGGAATTTCCTCTCCAGATATCTGGCTGTACACATGTGAGTGTTCTAATCTCATTTTTTATATAGAAAATATAACTGAATTAGCTCTCTAAAC 
TTTTCCTCATTTCTTACTCCTCTTTGTTTGTATGTCCTACTACCAGTGTGCATATAAGAACTGGAGAGATGGGAGTTTTAAATAACAACCCTTCTTAATC 
CCTTTCATGTTTATATTTAATAAATATAATCACCATGGATATTTCAAATAAGACATTATTTTTAGTAGTCGGTACCTTTTTTCTGACCACTTGTGCATCT 
TGCAGTCCAACTCAAATCGTCTGGAACCTCATGGTAGCTTCATTTGTTGGATTTCTAGGACATAAACTACTTAAAAACATTACACCTGTCAATCTGGATC 
TTGTCGGAAAAAGCTTCGTATTCTCTGCAAGTTTAACCATCTCAGAAGAAGCCCGTTTATTGAGGATTGGAAACGTATTAAGGGACTATAATGGCAACAA 
TT TTG AAG AGT ATG AAGAAG AAGAAGAT AGTGGT ATTGAAGAAT AAT AAT AAT AAT AATG GTTT G AAT AAAATAT AGAGAT ACATTT AT ATTGTT TT ATT 
TGCATTATATATAAAAAAGCACTACAAAATTTCTAC^CATACATTGAGAGAAAAAATTGATACAATTTCTTCCTTTTTTTTACTGGTATCTGATTTCTTG 
ATATTCGAGAGAGTTAGTAGTAGCAGAAGAAGTAGCACCAACTCCGGCAGCAGAAGTTGTAGGGATAGGGGCGGTGGCACCGGTGGCAGCGGCAGCGGCA 
GCAGCTTCACGGCGTTGTTGTTGTTCCAGGATAATCTTTGCGCGTTTATTCATTTCAGTGGTGCGTGTAAAGATGCCTCTGTTGTATTGGACCACCTTAT 
TCCTATATTCTCCGATGTCCTTATCGCGCTTCTCAAAAAACTTGGCCATTTCTCCACAACGTTGACGGGCTGCCCATGCAGTCATAGCCATAAAATCAAA 
TTGGTCAGAGGTGATATTATGGTTGTTTAGAAGGTCCACAGCGTAGGCCATACAGCTAGCGGCACGTTGAATATTGGAGTACTTGCTGTTTCTTGGTTTA 
ACGGTATGGATCACCTTACTTCCATCATTCAAGAATTCTGCCTCTTCTCTGTGCAGAAGAGATGTGCCTCTGCTAACTGTCCCGTCGTCGAGGACAAAAG 
CAGGTCCAGGTATGGGTGTTTCGCAATCTCCGTAAGAAGTGTCAGCAACCGTGAAGTGGGAAGGATATAGTCCATTATCTCCAACTCCAGAAGTGAGTGC 
AGCGCTATAAACTTCTCCGGTGATTTGGCGAGTGACTGGATTACTTAGCCCTGCGTCAATTTCAGACATATTTCCTTCTTCGGGTGTGGGAAGGAAGGGC 
CTAGGTGGGGOSAGAGTGTCGTTGTTTCTTTCAGCCGTAGTAGTAGCAAGAGATGTGACCCAAGAATATTTCTTGTTCTTAGACTGTCTTCCAGAACCGT 
CGTCGATAACAAGAGATGGTTCGTCCTCGTCACTGCTACGGTTGCCATTCTGGTTCTTCTTCCTCTTTGAATCCTTCTTCTTAGACTTGGACTTGTTCTT 
GGGGCCAGGAACAGGTATAACATCCACATTGGGTGTAGAAGACATCCTCGTTCCAGTAGATTCAGTCAGAGATGTAAGTGCTGGCACGGGAGGCATGGCC 
GTAACAGAGATACCTTGTGGTACTCGGAACATTGCAGAAGAAGACGTAGAATTAGAGTTGATACTTGTAACAGCAGAAGCAGAAGTCCGAGAGGCGATGG 
CAGCAGCTCTGGCGGCAGCAATCATAGGGGCTGTAGrGGTGCAAATCGGCAGAGTGTTGGATGAAGTCGTGGCAGCAGAGGTAGAGTTGATGTGAGACAT 
CTTGATTTCTTGGTTGGTTGGTTGGTTGTGGACTATTGTGTTCTCAGAGAGGACTTGTAGTCGACTGTGCTCTACCCAGATCCACAGCCCCTTTCATATC 
CGATTTTCGGGGGTTGCCAGATACTAATATACTCCTCCACGCCCTCTTCTTGCGCCATATCTATAAGGGTCGAAAGTTGTATACCTTGAGTTGATCCGGT 
TATAACCGCCATTGTTGGGTCTTTCTGAGAATCGAGAGTTGCGATCACGGCATGGCCTCGTGGGGGAATTTTCATTGACGTAAGCAGTAACGTCAGAAGA 
ATTGT TG G AGCGGGGTTGGT T AAAGTGCC TCTGGTGTTGGTGGCGGT GTT GGTGGTGGC GGTGGTGGTGAGC AGCAGC AGGGGCGTC GT AGTGTTCGT AT 
TGCTCACAATAATAGTCCTTGGGCACTTGCATTCTTCTTGGATGAATATATCTTTCTCTTTCTTCGTATCTGGCTTCGTCCTCTCTTTTTATGTGTTCAG 
TTTTTATGTGAGTTACTCTGTCCATCTCCATTTTCTCAGAATTTCTTCCCCCTCCATTGACGTCATCCGAATAACTCAACTGGCGCCGGTGATACTGATG 
ACGTCTTATTCTTCGGACCTTATTATCCCCTCCTTCTGAATCATCAGATGATTCAGAGTCAGATGAATCAGACGAATCGGATGAGTCAGAGGAAGAAGAA 
GAAGAATCGGATGAGTCAGAAGAGTCCATGGAGCCACTCCCAATTCCACTACTAAATTCAGCTTCCAATTCTCTGACGACATCTATATTTTTAGTATCTT 
TTTCTTCCACATCTTGATTCGTATTAGGGTATGTCACACCAGTTTCGTTGCTTTGTTCTGCGAGGATATTTTCGAAACCTTCAAGAAAAGGAATGTCAAT 
AAAACCTTCCATTTCTACGATCTCCCCTTCTTCCAATTCTTCCTTTTTGTTCTCTTCTTGTTGGTTTCTTGAGACGGTGGTTTCAGTAAAATTTGGGTTA/ 

tctgtagatccaaatccaccagttccacgcacgcnrrcttgcttgatttttgttgtcctttttagaactagagtcaataatcgttctctctcccgtggttt; 
cattaatatacacaatctgttcttcctcgacatcacaatatcttaagaaaatcaactgggcaa 

ac tat ggttgcgc agaatc act tt c aattc tcccct at aatc c ac atcaatcgttcc agt aggt ac ac t agtgttgttcttcc at gt c atcccagaacgt 
gacacaatttgtccataacagccgtcgggaaacttgtctattccatatccagtagagatcttggccagtcccataggttcgatatccatttcttcagag; 
gaaataggtcgtaggcgacagaaccgggcgtggcacgtctgggcggaagtgcagtttcctccccgggaggggcgaatctcataaacacgacagatgcag 
tgagtccatggctggatacagggagtacacagtggagacgggatgtaataagtgttcaacgtctggaggacaagtgagtaccttcctcggttcctcccf 

CCTTTTATAATCGTGAAACCCCCACACCCCCGCTAAGGCAGTAAAAAAAATTGACCCACCATCACCCCTTGTCCTTGAAG<3AGGAAACTAGAAGATAA/ 
TTTATCATGGCAACATTTACTGAACAGGATCATAAAAATGCGTTTTTATATGCTAATGAGAAGCTGAG<;CAGGAAAGAATATACAGACTTAAAATGTC^ 
AGCCTTCAGTTTATGCTTTTATTGACATAAAAGAAATAGAAAATGGTTGGGAAAAAGAATTCGGGCTTTTAG^ 

CAGGGATATTTCTTATGACTCAAGCAAACTTGATTGTGACGCATTTTCTTGCATACCTTCAGATATACTTCATTCTGATAATGAAAAAAGAGTAGGAiG 

TGCAACTTTGCCGAACACACCTCTGTCTCATTTCCTGTCAAGAACCCTGAGGGAAAAACATTGCGCCATTTCACGGCATGTGGTCCAGGGTGTTACC5A 

GGTACAAGCAAAGAGACCCCCATACTGGTTTGCCAGTAGCCAGAGGCGTCCTGATGCAAGATCACGTTGACCATGAAACTGGAAATAAAATGTGTGArA 

TCTGAACCAGAGCTTAGTCATGTGGGCTGCTGTCCCTTGGATACGACCTGGAGACCTTACTGAAGGTTACAACACAACACACGTGCCTGGCTTTGC^TC 

AAGGAGG ATG ACGAAAGGGATTCAAAAAGAGTTAAAT ATG AAAATGTGGTCATTTCAAAGGCTT ATTGT GATTTCTTTAAAC AGT ATT ATG ACGC^ 

CTGGCTCATGTTATCGATCTGGATGGATGAAATTTGTCCATTTAATGTTTGGGCAGTATTTTACTAATCTTTCTTACAATTTAGCTAACCCAAAACff'TA 

CAATTTAACTGGAAATACATGGTCTGATGTAGTTTCTGTATTGACAGATAACCCTATTGTAGATGCGGGTGCTGCTCCGTCCAGATCTGAAATGGAGAA 

ATTATCACCAAGAAGAAGTT<^^CGTATTCCCTTCTGAGCAGACATCTGCTCGCCAGAAAGCAGAGAATATAATACGTTCTCAGTACGGAGATGG^TAG 

AAATAGACCCATCTTCTGTGGATGCTTTAATGCAGTTTGTTAATAGGGAAGGTGTAGTAGGAACAGAGAAAAAATCCGACCGCCTCATGCGAGTGCAGA 

CGCTGTTATGGATGCGGCTATGCGCCTTCAAGTCATG<^TCTGGACGACAGTCAATCTAGACGATTATTGTTAAAAAATATGATTAAAATGAGTAAAAC 

AACC C AG AAT ATGC AAG AC ATT TTTCC AGT TC TCTC AAAT T AAT TGG AGT AACTTTGGCC AT AAAAAG AT CT GT TTTCT CT AAAGGCGCTTC AGC AAAA 

GAAAGGAAACGGCCATTAATAATGGCGAACAGCATAGAAGAAGCAGATGGTCACCCGAGACGGTGACAGAAGAAGATGCACTTTTATTTGCAAGfiAAAA 

TATCACTGAAGACCCAAAACACCCTGCGCCCTTTGTGGACATTTTACACTCACCAGATATCAATTCATCTATCAAGAGCGGTTCTTCTTCCTCATATGG 

AACGATATACTTTCAAGAATTTCTTCGACCAGAAAACTGCAAGAAAAGGCGAGTGTTTTCGTTAAAAATCTTGTGGTGAAAGTAGTGAGACAGT 

AC ATTTT AG AAGGT AAAC TGTT TTC GG AC GGAT ACGAATGGG ACGAT AAT ATTCCTCTGATGAT AGGCGT GG AC C AAAT ATT G AG AG AAGTC ATT AAGGC 

GGCCAGTAATATGTGCGCTAGATTTGCCTCCTCAGCTCTGGAGTCTAGTTTGGTTACTGGCTTTATAGACTCGGCGAGTGCTATCACTTCTAGTTAGCC 

GT AC AACTGGC AGCC AG AAC GT TCT CC GT ATTCTTGX^GG AGTC TGTT AT AG AATTTGT AGTTGCC GC AAGCCT TCGGCT AGCG ATAC AGGC JTTTGCC G 

ATCTGGCAACTCTTGCCGCGTCAGCCTTAACCGTTATTGGAATTGTTATATTCGTAATACAAGTACTTGGGTTGAT^ 

TTGGTACGATCACATTTTTAGCCCAGAGGATTTAAAGAAGCAGGTTCTAGTGTTTAGGAGAGAGTTTGCAAAGGCAGGAAATGTAGATGTGGiTGTAGCT 
CAACCAGTCACCCCTGAAGAAATCGTCGCTATCAACGTTTTCCTTCAAACTGAAGAAAACGGGGAAGAAAAGAAGG^ 

TTGATTTTCTTCAAAAATACTTCCATTCTACTCCTTTGATGGGAAAGAAAAGTAAGTTTGTCTACATACAAGAAGCAGCTCAAGAATACTTG^AGGAAG 
AACAATGAACGCTTTTGGGCAGCGTATAATAACAGCTGCTGATGATAGTGACACCACCACCACCACA 

AAGAAAATGAGGAGTATTATTCTAGAAACAGGTCAAACTCTTAAGGATTACTCGTCTGCTGTTAACTATAACGCCTCCCGTCTAGATTACGrGGGAGAGG 
AATGGGTAAGAAATACTGCCCTAAAAGAAGAGACAAGGAGCAACACTACTAGTGATAACCTATTCAAGAAAACTGTTTCTCTTGCTAGTA1GGCCGGCGC 
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ATTTCTGGTTCTAGGAATAGGTGTATTGGTAGCGTCCCATATTACACTTTTACGTTTCACCAATATTGGTCTAGCTTTTGCGTTCGCAGGTCTCCTAGCA 
TTTATTGCACTTATGAGTATATCATATATAAACATGAATGCTATGGGTGTAGrGAATTCGGACGCAATATACAGGTCTACTGCTCTAGTTGGAGATATCA 
AAACAGACCCCAGAAGAGTAGGAATGGTCCAGCGCCACGTAGGTGTCGGGGCrAAATACAACATGATTACAGATTTCGTCTCTCCAATGTTAGACGAGAT 
CGAGAGTGACTAAAAAGTGAGGGTTATAACCCCCCCCCCAACAAAGGCTTTAAATTAGACTCAAACCAAATC^ 

GAGGAAGGTCCGTGCTCAAAAGGTCCCAAAAATATTCCTCAATCAAAAAATACCATGGATTCGTTGACTAATACAGTAACACTCCTCGTGAATGACCGTC 

TT GG AAATC ATCG AAC AAAC AAAC C AATC ACCGAAC AAG ATGTGG AAAAT AC TC T T AACC TT AAC AGTCT AG AAAGGGC AAGTC TTTTG AAGTT AT ATT C 

TGTTTTCATCAAAGAAATGCAGTCCTATTCTGGATGTATACCCAAGAACAAGTACACGAATGTGCAAGAAATATTCGAAGATGGACTAATTACTTTCGAA 

TGGAGAGATGGAACAAAAGTACACAGATCAGTTTCACCAAGTTCCCCTATACCTCTTTCTACAAAAAAATCGCCTCGGTCCTCACCTTCCCCTCCrCCAT 

CGATGCCTTCTATTAAAGAAGAAGAATTTGAAGAAGAATTTGAAGATGATGAAGAAATATACGAAACAGATGAAAATGTGGAAGATTTCATAAATGGTGA 

TGGAGAAGATTCAGAAGAGGAAGAAGAAGAAGATATAATTGTTGATGACGAAGAAGAAGAGAATGAAGAGGGAGAAAACAAGTACGT^ 

AATCATCTGAGGCGCCAGACTGCCGCCGCTGCCGCCGCCGCCGCAGCTGCTGCTGCTGCTGACATTGAGAAGAAGGACAAAAACCACGCAGTTAGTGCGC 

ATGACTACACACTGTCCGCCCTCCAGCAACAGCAACAAAAACTGCTCC^GCAACAGCAACAGCAACAACACCAGCAGCGCTCCTCATCTGAGAAGGTCAC 

CTCCACACCCAACAAATTCAACAAGTTTTTACTGCCGAGTAATGGCTTCTCAGAACAGACCGAGCTCTTTGTTTGTTTCGATGTGGATAAAATTGCGCAA 

TATAATGGACTCGTGGAGCTAGACATCTTACCCATTGTTGCTGAATACATTATCAACGGCCTTGGTCTGAAATGCAGTATGGAAACTCCCCCAGTGAAAC 

CGTGCAGGAGAAAGGAAGTGAAAGATGTGTGGTGTCAGCCTAAAACTAGCTTTGAAAATGATGCTGTGGAAGATAAACATCTCGCATTCGCAGAATCGCC 

TATACTTC^WVGGCCTAGAGATTTCCCTATCCCTAAAAAAATCACCGCCTATTTTTGTTTAGACGATTCTGTAGACATTAAAAACCCCTGGGGTTCGTGT 

CCGCTTTTGAAAAGTGG AT C AAAC TTT CG AGTGT CCG AAT ATTCGCGTC ATTT T AATGAATTCT C AGG AGTT AAAAATGACG ATG AC ACGTCTTC AAAC A 

CTTGCTTTATATACTCTCAAAAAAACCCCAACATTGAAATTGTATCAAAATTAAATATTGAATTTGAGGTAATGATGGAGGGAATTATAACCCATAGAAA 

AGATTTGTTCGAGACGGGCATTTTGAGCGATTCTTCATTAGCTACGXSCTATGGCATTTTGCCACCCAAAAGCTAGAGTTCGAAATGTTGCArTGTTTTAT 

TTTTCTGTATATTTACCATTCTCTAAAATAACTCGCAAAGAAACTATAAAATGTTCAGAGACGGATAAGGTACATATTGGTTCAGACGCGATCTTTTCTC 

CCCCAAGTGATAATCCTAATATAAGTGCTCACCAGAACAATAACAACAATAATAACAATAATACCAGTGTGAATATCGAGGACAGACCTATCCGAAATAA 

TAATATAAGCAGAAAAATGACCATCACAAACTACCAATGTATGGCATGCAAGGAAAGATGCACAAACAATTGCACTAACGGTAACTATCCCGATCGTGGT 

AACCAG^ACTTGTCACATAGTGTAAAAGGGGAAGATTTCTTTAAGATTTTAAATAATAGTAAA 

CCGCTCCTCCCT CTGGAAAT T ATAC AT C T AAGTTT TGTG AT AG AAGC TCT ATGTGCC AT AGC TTC TTTT GTAGAGGG AT C GAACC AGTGTCT AC T TCTTT 
CTCATCGGACAGTTTTGAAAAGACTAAACTTGTTTTGTATGGTAAAGTCGTTGACGTTATCAATAGTTA 

ATCAGGGTCTTTTTTAACTCTGAGGAAAAAGATAATAAGACTATCCCCTCTAGAGCCGAAAGTGCAAAAAATGCATTCAAGGATATACTTGTTCACGAAT 
GTAATAAAGAACGAGCTGTTTCATATTTTGAGCAAAACAAATTATCCTCTAAAGATGGGCATCTATCTAACAAGTGGTGGATCGAACTTAATGACTTGAA 
CATTATGT^TTGAGAAACACGTGGAAGATTTTTACAAGAAATGTTCTAAAGTAAATGATGCAGAATCTTTA 

TGTGATAAATACAAAACTGCCAAGAGGGCAATTATTGGAGCACAAGACCCTTCTACTTCTACTCCCTCTAAAAAGGAGAATGGTATCACTAGGATTATTA 

GTACATTATCCGAATTTCATTCAAAAGATGAAGCTACAGTAAGTGCCCTTCTCGACAAAACAATGCTCTTGGGATCGAGGACAATAATGTCTGGTGTTAG 

ATGTGTTATACGTAACAATAGTGTGTTTTCGGGCTTTGAAAATAAGAA(^CTAATAATAATTGGGAACTTGAGATTAGACACTATGTCATCTCTATGGGA 

GGTGCTGCAGTGACAAAGATTTCCGATGAAGATTTGGAACAATTCACGCCTGTAAGAGGTGCTGTCTCTGTCACTACAGCACCTAATGATAAGCTACCTG 

TAGGGGCACATCAGACATGGAAGGATGAACAAACACTAAAAACAAACACTAAACGTAATAGTCTATATGACTCTTACAATTCAAAAAGGAATAATAGGGA 

TAATAATAAAATAAAAAATCGTTCATTAAAACTATCAGATTTTAATTGGAGAACACCCAATATCTCTATTCAAGAATTTAATGCAAATAAAGATGATGTT 

AACAAGAAGAGGTACGCAGAAGTCGTGGCGTCAGCTGCTCCAAAGTCACCTTCACCAACGAGCAGCAGCAGCAGCAACAGCAACAGCAGCAGCCCTCCTC 

TTTCACCGCTCTCACCAACAGTGAAGAATAGTAATAATAAACCATTGTATATTCCTCCCCATAAAAGAATGACCACTACTGCTGTTTGACCAATTTATGG 

GTTATTTTATAATGTTGTGTGATATTTTATACAATAAAAATATATAAGAAAACAACCAAATGATTTTGTATATCCTTGTTCTGTGTTGGGCGGAAATTCC 

TCCGCGCCCATTATGATGAAAAAATTGCTAGAGGCCTTGAATCCTGGTC^CGACAAGGTTAACAACAAAAAAAATTACCTGTATGTGCAAGTCATATCCT 

CCAGTCCCACGCATGAGAACCTGTTATTGCACCACGATTCGTGTAACAAAGTCGTATCTGCACATGTGCATAAATTCCTTCACGCTAAAATGAACGACAA 

AATTGCCGGGAGCAATGGGCGTGAGAGCCGGTTCCTGGAAAACCCCACCTCCGTTCCACGCACTACCTCACTATTGCACCATTTTTTGGTGTATAAAAAG 

GCACGGAGAGGTAGCTAAAACATCCACTCGCAGAGACATCACCGTCAAGATGACGTCTCCTACTAGTGCTCTCCGCGTTGCTACTACTGCCCTGCrGTTT 

GCTGTGGTGGTTACGGCTACTGTGATTTCTGAAGCTGGGGGAACTAAGATAGACGACCGCTGCGCCTTCCCGCCATGTGACTACTTCCCGGAACCrGAAT 

GACGTGGAAGAAGATGTGTTAGAGCTGCTCGAAGGGGGAACAACAACAATAACAACGGAACCCGTGATGGAGCCAAAGTCTTTGGCGAAATCGCAAATGG 

ACTGATTGGGCGTAAGAGGCGTGAAGCAGTATCTCTAACACCAGTCCATGAAGATATGCCAGATTTCTTCCCCTACCCTCACCCTGAGCATCCCATTGGA 

GGGCGTCTACCACGTGAAGCAGCTCCTCCACTTAAAGGTGCACTTGGACGTAAAAGGCGTGAAGCAGCTCCTCCACTTAAAGGTGCGCTTGGACGTAAGA 

GGCGC G AAGC AG AAT CCTT GG AGG AAG AAC TTGTGTCTGCTG AAG AAGAACGTGAAAAGC GCG AAGC AGCTCCCCC AC T T AAAGGTGC AC TTGG ACGTG A 

AAAGCGCGAAGCAGCTCCCCCACTTAAAGGTGCACTTGGACGTAAGAGGCGCGAAGCAGCTCCCCCACTTAAAGGTGCGCTTGGACGTGAAAAGCGCGAA 

GCAGCTCCCCCACTTAAAGGTGCACTTGGACGTAAGAGGCGCGAAGCAGCTCCCCCACTTAAAGGTGCGCTTGGACGTAAGAGGCGCGAAGCAGAATCCT 

TGGAGGAAGAACTTGTGTCTGCTGAAGAAGAACGTGAAAAGCGCGAAGCAGCTCCCCCACTTAAAGGTGCACTTGGACGTAAAAGGCGTGAAGCAGCTCC 

TCCACTTAAAGGTGCGCTTGGACGTAAGAGGCGCGAAGCAG<:TCCCCCACTTAAAGGTGCACTTGGACGTAAGAGGCGCGAAGCAGAATCCTTGGAGGAA 

GAACTTGTGTCTGTTGAAGAAGAACGTGAAAAGCGCGAAGCAGCTCCCCCACTTAAAGGTGCTCTTGGACGTAAGAGGCGCGAAGCAGCTCCCCCACTTA 

AAGGTGCTCTTGGACGT AAG AGGC GCG AAGCAGCTCCCCCACTTAAAGGTGCTCTTGGACGT AAG AGGCGCG AAGC AG AATC CTTGG AGG AAGAACTTGT 

GTCTGCTGAAGAAGAACGTGAAAAGCGCGAAGCAGCTCCCCCACTTAAAGGTGCTCTTGGACGTAAGAGGCGCGAAGCAGCTCCCCCACTTAAAGGTGCA 

CTTGGACGTAAGAGGCGCGAAGCAGCAGCAGCAGCTATGCCTCCCCCTGAAGACGATCTCGACTTCTTTTACGCACCTGTTGCTTTGCCTCTACATGGAG 

TATGGAAAGCACCAGAACCTACAGGTTAAAATGTGGCTGAACATCACATATACAAGTTAATTGGATAATTGAAACAATTTGACTTTTTAGTACTTAATGT 

TTATTAATTGAAACAATTGGACTCTTTCCTACCTAAGGTTTATTAATTGGTACATTGTTAACACTTGCTCTTTGCTCATAGCAGACATACCTGCTATTTT 

TAACGCTTTGTTTAACGTGCTTTTTACAATTGATATTGGTACATGTTTGTGACCTCTTGAATAAATTTTGTTAGCATCAATATTTATTTTATTTTTArCC 

TTTTCTTGTGTGTTAGAATTAGAAGGAGATGTGGAGGTTGTTGATGTAGTAGATGCTGTGCAGAGTTGCTCCTGGTGTTGCTCCTCTTCTTGATTCACAT 

TGTCCTCTTCAACATACTCCTCTAATCCAAAGCGCTCTCGAATTTTCTCCTTTACACCCTCACCACAATCTCCTATCGAAGACTGTGCTAATATAATTTT 

TGTAGTAGAAGTAGAGAATAATGGTACAACGTTATTGATGTTGTTACTTCTGCCCATAGCTGGTAGTATTTTTGAAGGCATGCCTTAAAAAAGGGATATA 

TCTTTGTCCTTTTCTATACAAAAGGATAATCGTTCATCCCCATGAAAAAGTGTTAGATGGGGCAATGTACACCTATAGAACAGTGTTATTTCTT 

CTTCCAGTATTTATACCAGACCCTCTTGCCCCATAGTTTCGTATCTCCATTTTACTATTCCTGTCTCTCTGGTGAACCCATTCTGAGCCAGGTGTTCTGC 

CCATAATTTCTGATGTTTTATCAACAACAATATCAGCTGAAGGGGCAAAAGGCCCCCCATCTTCCAGTGCACTATTCCCATTAGAAGGAAGTGGACTAGC 

TGGCATAGGTGACAATAATGAAGGGAGTGGTGAAAACACAGGATAATCATCCACACCCTCTGTCACTAGAGATTCTAGATCCGGGGGAGGAGGTGTAGGC 

AACATAATTTCCATCTGCATTTCTTCCCCCGACGTGGACGGTTGCTGTTGTTCTACTCCTTCTACCTCTACTTCTACTCTTCCTCCTTCTTTATTATTTT 

CTGCTTCTAGTTCCTTCATTACCATTTCCACGCCTGTTGCTAGTCTCATTTTCTTATTGTTATTTTTTATGCGTTCTTTAGCTTGCTTTGTTACTTCATA 

TGTAATATCAGATGAGTAGTATTTGCCTGGACAAATAAGGTTATCACTCATGGCTGTATAATTAGTATCAGAATTGCCATGTGCATTATCTTTTGTTATA 

AACTCTTTGGGTGGTTTATTTGCACCAGCCGTGAAAGTATTTGCCAGAAGAGCCGAGGCTGATGAAGAGTTTTGCTTAACATACTTTTCATCGATAGTTT 

TATTAAATAATGTAACTAAACTGGCCGAACACTTACTATCCCGGCAATTCGCAGATGCATT^ 

AGTTTCAATTAAAGCTGATATTTGGCCCATGATTTTAAGGCTCGCACACAGTGCCATTTCCTGTCCTCCTTTCTCTAGTTGAATTGATCCCCATTCA 

AATTTTTCTTGTAGAGTCAGGAGACTACTCTTGAGAAAAGAGTGTAAAAGTGTTTGAGTAGTTATACAATTTTTCAACGTTTCAGTGATTGGGTCGAACC 

CG AAT TTG ATCTGATTT C TT AC AC T ATCC AGT AAATC CTTGTTT AATTC TTCCGTGTC AGAGG AAGGGAAAG AAATT AT ATC ATT AGCTGGAATTGATTC 

AGAAAAGTGAATATTTTTAAGTACATTTCCGTCTGTTGAAGAAGGGTGCTTTAGATCACACGTGCTAAGCATTACATCTAATGATAAGAAAGGGGTAACT 

CTTTCTCCTGTATCGAGATTTCCTACAACATTTTCTGGTGTCAAGAGATCGTCTGTTTCTGTCACTCCCACCAATCTCTTGTAATCTTCCTCAGCAACAG 

TTTGGTTGTTGTTATAGAAAGACCCATAAGTCTGAAAAATTCTCCTCGCAGTCTCATTATCTGCATCATTAGCTCCTCCTCCTCCTCCTCCTCCTGCCAC 

ATTTAAAAGATCTAGGTAGCTACTCATCTGGGAGGGAAGGGGAGGGTTAAGGGTTGTATATAAAGAGGCTTGTCTGAAGACGCGAGTGATCACAACTTCA 

ACGATGGACGTCGAGTTCGGTTTCTTCCACGGTCTGCTCTCCAAGGCCCTTCTCCCAGATGAAAAACATCAACCCGTTATAAGGCGTCTTTGTGCGGATG 

ATTCTAGAAATAAGGGAGAGGATGGCTGCTGCTCGTTCTGTGGAAGAAGAGGAACAGGAGAGAGCAATACTGCGTGCCTTGAACAACTAATAGACGTTTG 
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TTCCTTTATAGGAACTGTCTCATCTATTGGTRCAATTATCAATTCTAATCTTTCAACCAGTTGTTCTAGACTACAAAAAACGTCAGACAGTTATGCGGCA 
TTATCCCATTCTAGTTTTCTGGATGTGGTATATCCAAGTTTGAAGAAAACAACTGAAGACGTATTGCCTCATTCTrrACGTGCCATTTGGAATAAACAAC 
TTCCAAAGTTGTATGAAAAAACTCTTCAACCCATAGAAGAAGAGGATATTGGTTATAAGGATTATGTTGTTTCAATTGAAGACGACGACAATGTTGATGA 
TGGTGACCAACAAGAACAAATGATTATTGATGAAGAATCTTATAAAACTATTGGAGA 

TTTGGTAATGAATTTATAAGGATTCCTTTAAGAGAAACTGCGTTGCACGCACAATCTCTGAGGTACGACACTGAAGCTAAATTTGTTAACCACAAGGACT 
CTATACCTCTATTTTATGAAAACAGCACGTGCACATGTAAGGAACGTCTTATTGATTTTTCTGAGAGACAACTACAACAACTAAAACAAGATGGAATGGA 
TAAACCAACGGACAAGTAGAGAGGATCTTTTCAACACGTATACAGGGAATGCAGTAATACGTTCGGCAG 

AGCAGAAAGAAGAGGAGAAAAGGCATGGACGACTrCAGCAGCAGCAGCAGCTTCTTCTAATTTTAATAATGTACAACAAGATTATACTGATGATGATATT 
ACACAAGTGTCTATTGCAAACAGTGTTTTGAATAACCCCTTTTTAAAGAGATATGCAAAACTTATAGATAATTTAGCAATATCTTCTTTACCTCCTGATA 
TAGAGGATGATGTCATTATACACACTAGAGATGCCTCCAACTCTACAGTCAGAGTAGATGGAGCCAATATCTATTTCGCCATAATTGACGGTGATTTATG 
TGTATACCCTAAACAATATATATCTGATAAAGTGCTGTGTGGTTCTCTCAACCGGGAAAAGGCACTGTTCTATAATAGCTCCAAGAATAAGTGGACGTAT 
GGATGTAACCTAAACTTTGATATCGTTGACGCTGCCATCATGAAACACCCCGACTACAAGGAAGAGACTACATCTACAAAACATATACGTAAAATATTGG 
GTATCGGAGCATCGGAAAAACTGAACATTACCCACTATTTAAACTACTTTATCCAATAAATCACCATGTCTATATACAGCCCAGAAATGTCAGCCATGAT 
GAATGAGAATGAAGAAAAAAACGTTCTCAGTAATAGGAGGGCTGTAATATTGATCTTTTTCATAGGTGTTGTCATTTTATCACTCTTGTTCTTAATCAGA 
CTAGATATAGATATCAAATATGGGAATGTGGGCGAGAGAGCCACTATTGTAAAGAAGACTAGTCCCATTTTTAAGGAACCTATCCAGACAGCCATCAACT 
TTAACAACAAAAAGATTTCCCACATTGAAGATGGAGGAGAAGAAAGTGATGTTATAGCTTCAAGGACTGTTAGTTCTCTCCAGAGAGATCCTCGTGTACT 
TTTAACCAGTTCATTTGGATTGTATAATAGGACAGATTATATGAAAAAATAAAAGTACCAAATACCAT^^ 

CTTTTTAAATCCACAATCAGAGCTAAAACATAGTGCAGTTATACAATACAATAATACATTTCTCCCTCCAATGATACAATGTTGATGTTCTCCTCCCATA 
TACTTTTCAATCAAAGCCGAGTCTTGTTCCTTGGATAATTTATCCATGACCAGCTTAGAATTTCTCCAGTACATGGGACGAAACACTAACCTCATACTAG 
GATTCGTATCATTGTGGGAAGGGTTTATAC^ACTTCTTGATGTCTTTACATCCTCAGTAGGCATTTCACCCAATAAATTAAACACATCTCTTATAATAGA 
AACAGACTCAGAATCATACTTTAAATCTTCTATGCATTCACCTCCTATTTTAGATTTATTAAAGGGTGCAAAGAAAGGGGTTTCTTCCTTCTTCTTCTTT 
TGGACATCATTTTTACCTAATACACTTAAACCTATCTGTTCTCTTACAAAGGACTGAAAATTGATATCTATATAATGTACATCTTCAAATTTTTGGATAT 
TGCG ATT AACATTTTTGTAT G ATT CTGGAAC AGGGATCTT CT TT GC AGT CGTCATC ATT CTGGAACACGTGC AGGC AGGTTCTG GG AC AAC ACCC ATGG A 
TAGGGCGATAGCATTGGCCAGTTTCACTAGCCACATTTTTGTTTGTGGATTAAAAGTCTTCTTCTTTCCAAAACGAAGAAAGAAGACGCTACATACGTCT 
CCTTGCTCGTCACX;AATTACACGTATTGTGTCCCTTTTGTTTAATATTTTGTAGGCAAATTCCAGTAGTACTTCCTTTCCTTCAACAAGAGGTATTCCAA 
GTTCTTCAAAATCCTTTTCTAAATAATCCACACTCTCCTCTAAAGACAAGGAAGAGGATGTGCATTTGGGCAGCAAGATTTCTGTGGTGCACATATTATT 
ATTCAAACCTTGAGAAGAGGACATTTTTCTCTTTACTGTTCAGGACGAGGAGAATTGGGAGGAGTTATGTTAGTTAGTGATACGTCTTGCCGCTGCTGAT 
GCTGGAGGTAGAATGAAGACCACAGCGTGAGGGGCGACCATGGCTTAGATATGGCCACGCCGAGCACTGGCATGAATTGCCCCTGAGAGGGTAGCCATTT 
AGTGTTTTCACTCGGTATTTTTCCCTCTTTATTGAGAGTTGCaVAAAGTTTATTATAGTTGTCTGTGCATGTCGATACCTTGTCTTTAAAATCGTCCAAT 
ACTTTTGGAATCCTCTCCCAGTTCTTGCGTTTTATTTGCTTGATTCTGGGAGTCTTTTTTAGTTCAGGCATATACTGCAACATTGTGTATTCTAGTGCAG 
CAGACATAAAATTAGTTCTATCTGTGCATATTTTATCCTCTTCTGTTTTACCACAAAATTCACTATGTATTATGGGCATTAATTGCTGGTACACAATAGT 
ATTATCCTCTAGACCGCACATGTTCATATACACGATAACTAAAAGGTTCATGGTTTTTACACTATAAGGAACACCAGAGAATCTTTCATCGTCGATGCCT 
TGAACTGGTTCATAGAGCACCACGTTACTATTGTTCCCGTCTTCTCTC^ 

TGTACGTTATATTGTCAAGAGAAATGCATTTGTCGAATTCGTTAACAATTTCTGCCCACTTTTCTTCATCC^TCACGGGTTGTTTATTGTTAAGTTTTG 

TGATACAGAACATCTTGAATGTTCATCCTCTTCTATATTATTTTTAGCGAGCGATGACGTCATTTCGTTCGTGGCTGCAGTTACAGCTTCTAA 

GAATCTTTATACTCGAACGCTTTCCTTGCCTCATCTAACCAGTCGTCAGTTTCTAGATCTGCCTCCTCTTCCTCTTTTTCACCTTCCTCCATTTTTTCCT 

CCTCCTCCTCTACATCATCCATTTCATTGAATTTTTTAAAGAAATCTAAAAAGTCACTC^ 

AGTAGATAGACCTGGTATTGACGATACTATTCCTCCTCCTCCTCCATTTTCTTCTCCCAATGCTTCTGGA 

ATCCTTTCTTTAGTCAATATTCCCCTATCAITAATTATACCACCTTCAGTGTGGCTCTCAAAAAATCCACGAATTGATTCGAAAACTGACGATGATAATT 

TTTTAGTAACAGGTGCACTTTTTCTGGTTGCATTAACCTTACTTTTCTTACGGGTAGATTTTTGAGGGTCATTACCACTTTTACGTTTATTATTAGAAAA 

TGGGATAAGATCTTCAGTATCAGATTCTACATCCGACCCTGTATTTTTACTCTCATTATTATTAGCACCGGAAAGATGTATACTGATATCTGATAAGGTT 

GTTATTGTACCAAAAACACGTTGATTAAAGGACCCCATTGTCAAATATGACATGACATTTGCCGCTGCATTTACAGCCAGGGCTCTATGAAGTGCGTGTT 

TCAAACTAACGTCAGTTTCCAAATCAATCAAGTGTGTTGGAGGAAAACACAACTGAAGCGGGTCTCGTACAAAATCTATAGCTTGAGACAGATAGAAAAA 

TTTGATTGTCCAGAAAGGAAGTACTAGTTTCTTATCACATTTTCTACAATTACTTTTAGACAAATTCCCCATAGAAATArTACCAACACTATCATCACAG 

TCTTTATTATTCTTCCTAGCTTCTTGTTCACTCAAAAAAATCTCCCATCCGCCCACACAAGTACAGTACGTGTTTTCAAATACTGCAACTTCTTTCCTTA 

CCCATGTGTCGTACTCACAGTTACTAACAATTGCACTATTGTAGTCTGATCCTTTAGTTAGCATAATCAATAGTGTACTTTCAGGTGAAAGCATGAGCGG 

CGATTCTGCCAAATTATATATCGTATTTGGCTTTGTTCTTCTTTTAGCTGCAAAACCACCCAACTCGTTTGCTTGATTTTCACCCGAAGAATTACTGTTC 

ATGCACCTAAAGAACTTGACATGGCCAATCTTTAAGAACGTTTCAGAGTCCTTAAATTGTGTACCAGTAAGACGACACACGTTTTTATGATGTCCAACCA 

TCAGGTTCCATTTCACCAGCACATCAGAATCGTGGGAGAAAAGATTCATAGAAAGTGTTTGATTTTCATCGTTGGTGTGTTGCTTCAAAACATTCAATAA 

ATCTTCCCACCTCTTGGATTCTTTTCTGTTACCAGACATATTATAAGAATCGTACTCCAATATCTTCCTCTTTTCTAGTAGGGAGAAGTGTTTCCCAGGA 

GTCT CC AG ATTC AAAAGGC ATGAT GTT AAACGT AC GAT AT C ATCC TC TG CTT CGGTG AATGATGC ATT AG AAAGGT ACAT GC ATTTGCTC CCCGT AGTCT 

TTAGAACAGGGATCCTCATTATGTCCAGTAATGCAGATGCCATCAGACCTTGAGGAAGACGCATCATCATCCGCTCGCACGAGTAGAATAGTAGATCGTT 

GGTCTTCCTATTGAGGCAATTTTCATTCCTTGATTTTGCCTTTTCCTTACTTCTTTCTTGTCTGATCTGTCGTTTGGCTCTCGTCATCATCTTGATGGCT 

GGAGCCTTTCCATCCGTAAAGAGTACAGTATACCTCCCTTTAAGGTTTTTAGACTTGAGGAGATTTTTCATCACTCGGATCATCCTAAAAGCGTAGTACA 

AGATGACCCCCTCC^TCTGGCTAGCAAAAAATGCATCTTTGAATCTGGATTTCAAAACTGCTGTCGGGGAAGAATTGAATGAAGTAGTAGGCACGTTCCT 

GAACAAGAAACTGTTAGTCAAAAACTTCCTCTTCTTCATAACATCTGTGTGTTTGTTGATCCAGTAATCGAGACCCTCATAGTCGTCCATTTTGTGATTA 

GAAACTGGAGGAGCCATCATCATGTCTTCGATTGGAGGTTCAGTTAGCCAATCAACTTCCTCTTGGGATGAAGGGATTATAATAGGCACTCTTTCAATTC 

TAGCCTTAATTTTCTCGTGTTTATATTTTCTAGAAAAGAATTCAGAAATAACCTTCTTTTCATCCCTGCAGTAAAAATTTTCATAAATATGTTTCTTGTT 

CTGCATCCGTGAATGCATAGTATCTTCTTCCCTGATAACATTAAAACCGTCATGTAAAACGTCTATTATCTGTTGATTTGTAGTATTGATAAACAGGATC 

ATCTGACCCATATTTTTCTGTCCCTTCAACATGGTGGTTGCAATTTGCCTGCCCAACATGACCAGATCTGATACATTATAAGTGGGCAAATCAACCATGT 

CGAGTGTGTAGTCACAGGTAGGTGAAGAAGAGGAAGATTTCTTGCCTCTTCCAAAATCCAGGAAGTTCTTGGAGTTGATGTTGGCAAGAGAAGTGGCTTT 

GAGAAGTGAGCTAGTTGTGCAATCTTCTTCTCCTGAACAATAATCCTCGTCGTCTTCGTCATCGTAACAATCTTCATATCCAGTTTTCTCCTTGATGACT 

CTCCCATTCTGTGCACCGTACGCATTTTCGTACACTTTTTTGGCTTCTGGGACACCCATTTCTGTTGATGTTACTCCCACGGGATGTTAGACGATAATGA 

ATGCTCACTGATGGGCGCCGTAATTTTTACACACTCTGCGTAGTCTGGTGTTTTTACCCCTCCTAGAAGAACTTCGTGGAGCTGCTTTAAAGGTTGTAGT 

GCTTGGAATCTGCAGOU\ACGCAGCTCTCTACAGAGACTTCTCGGAGACTTGCCAGAAGCGAGTTTCCTCTTCTTAATAGGGGTACTTTCTTCCTCTTCC 

TGTTGTTGTTTGTAGAGTGTTATAGATTCAGTAGGGAAGTAGTTTCTAGATCGAAGAACAGTGTAGTCTTCTTCCTCTTTCACAGTCTCAATAGAATTGG 

ACACCTCGTTGAACACTTCTAGGGCCAATTCCCACACACCTTGTTCCTTAAGTGTCTCGACCACGTAATCAAATTCGGATTCACTCAATAACTGTAAGGA 

GGAAGAATCTCCATCCATGATACGAATCTGGAACACCTGTCTCCTCCTTGACTGTTAACTGATGTCTGACCCCTTGTAACCACTTTACTGATGTTTTATA 

CCCTCTCCCGACGGAAGATGTATACGTCACAATTTTTTTTAAAAAATAACGTTATACACCAAAAATGAGCGATACAGGGCAGATGGAAGAAAATAGGCCT 

GCTACCCAGAAAAGGAGACCTGGAGATGAAGAAGAGGAAGAAACTGGTAGTAGTAATGTTCCATATTATGCCAACTTTGGCGATGACGCCACGTACTCCA 

TGTACACTGGAGAAGGAAAAAGGGGTAAATTTGTATTAGAGCCACCTAAAGAAAGAAGTGTACAAAGGGTGCAAAAACCACCTAAAGAAAAGGAGGAAAG 

GGAACAACGTTCTAATGTTCGGACACGGAGACCTGGTCAAGAGTTTGAACAGAAAGTGCTACAAGATCGATCGCGAGAACGGTCAGAAAAACTTGGGCAA 

AATTTGGCAGAGAAAGGATTGCAAGAACGGCAAAAGAAATATACTCCAAAGGTAGCACAAACAATGACAAAAAAAATAATCAGGTTTC 

GAAAATTCAAGGCGCCGCAACAGCAGACATCTGACAAAGGTGCAGCAACCAATGTTCTTGAAAGGGAAGAAATTGAGATGGCTGCAGAAAGAGAACAACC 

AGTAGAAATTACAGGAGATACTATATTAGGTGGGCTAGGAGAAGAAGATGACGAAGATATGGGAGAGGATGAATTAACTATACAACATTCATCTATGGCT 

GTATCACAACCCGTTCAACAAATCGTTGTCAGTTCTCCTATACCGCCAAAGCCCACTAGGCCCGCTCCTGATATTCCCATACAAGAAGATATAGTGGGGA 

AAAATATTAGCCAGTTACCACCATTACCACTTGATGATTATGAGGACGAAGAAGACGAACATTTGTACGAAGAAGTGAATGATTTCTTAGTGGCACCACC 

AACAGCAGCAGCAGCAGCTTCCACAAGACCTCCCAGGCCTAATATTCCTCCTCCACCTCCTCCTGTTGTTGCTGTTGCAGACGAAACCTTGAAGAACTTG 
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GCTTCAATTGCAGCCTTGGAAAAGGAGGCCGAGGAACAAAGAGCGGCCGCAGTTGAAAGGGAAAGAGAAGTAGAGGAACAAAGAGCGGCCGCTGCTGCTG 
CTGCTGCTGCTGCTGCCGCCCAACGGGAAGCAGACGAAAAAAGGGAAAGAGAAGCAGAGGAACAAAGAGCGGCCGCTGCTGCTGCTGCTGCCGCTGCTGC 
CCAACGGGAAGCAGACGAAAAAAAGGAAAGAGAAGTAGAGGAACAAAGAGCGGCCGCTGCTGCTGCTGCTGCCGCTGCTGCCCAACGGGAAGCAGACGAA 
AAAAGGGAAAGAGAAGTAGAGGAACAAAGAGCGGCCGCTGCTGCTGAAAGGtSAAATACTTGCCCA^ 

AGGAAGAGGAGAGGCGGAAAGAACTAGCAGATAAGGAGGAAGAAAAACGTCGAGAACTAGCAGCCAAGGAGGAAGAAAAGCGTCAAGAAATATTAGCTAA 
AGAAGAGCAACTTGAAAAATTGAATTTCCAGTTGGGTACAGAAATCACGTCCAAAAGAGCACTC^ 

CGGTCCGCAGCCAGTGCACAGTTAGCGATCCAAGCAATAGAATATGAGGATGAACTTCCTCAGGCAGTCGAACCTCAAGGACAGTTAGTTCCTATGGATA 
CGGATTTGTACGGAAAAATGTACGATCTCAATAAGAAATTAGAAGTACAGAATAATACATTAACTTCTGCATTTGAAGACGTGAACAAAACAAACGAACA 
GAACCAATTG^TGCTCAATCCCTTGAAAAATCCGCTAAAGCCATTGAAAAATTAACTAGTCAAAAACATCTTCCTGTGGATGATCCTGCTTTTATGCAG 
AGAATAATAACAGAGAGGGATTTTTCTTTAAAGAATCTGGGAAATGTTTACAAA^ 

CGAAGGCATTAATTACAGATAAAGAATCAAGGGATCTGGAGGTGCGTCTAACAGATGTATCGACAGATCTCAGGGCTAATGATCTCAATACAATACTGGA 

AAGGTTGGATGTATCCGTTAACATACGCTCTGOTGGAACATTATACACTAAATTTACAGAGGCAGACACGGCATrAGCAGATCAAGTTCCTTCGAGGATT 

GAAATAAGTAACAGATCAAGATCTGCCTTATTGCCATTTTCATCTGCAGGTTTGGATACTAATTTTACTAATAGrTCCGACAAGTACAATGAAATAGTGA 

ACCAACTAAGCAGTATAAATGAGGCTATGAATATTTTGAAAGAAAATATTGTCCCAACATTGAACCAAATCAAAATTGATGTCACCAATCTATTAACAGT 

TTCAAGCTCTCGTCAATATGCTATTGAAGAAAGGGTGTATTCTGATGTGTCCCGAATGGATTCTGAAATAAGAAAATTCCTCGCTATAATGAACAGTAAA 

ATATCCCCTTATTTTAAAGGCGATTGGACGGATGAAAGACAACGCTCTATTGCTGACAGTATTTCCTCTCAGATAAAATCAAACGATAAAATTAAAGAGA 

GTGTTGCTACACTACACGATATCAATACAACCTCAAGAATACGTAGTAATCCCCTCCTGCACAAATCCTCAGTTTTATCATCTCCAGACTTTTTAAATGC 

TGTTAACGACTTTAGAAATTTTCTCGATATCCAAGGAGGTTCTCAATTTACTTATGATGTCCTTTCAGGCCJU^AATATTGATGACCTTTCACTGGCATCA 

AAAACCACTGAAAAGGTTACAGAATTGTGCCTCGAATTATCCATAATTTTAGACGTGATCCATAAAAATGCTTTGAGTTTAAATTTGCCTGX^TTACTT 

ACCCCGCAGGGGAAACATCTATGGAAGAAAGTGGTTCATTGGCTGTTGACATTAGACAAGAAATTGGTAAGAACATATCAGATTCTAGCGCTGAACTTAG 

TCGCACACTGTCAGAGGCGTTACAAATTTTTCAGCAACAACAGCAACAACAACAGCAACAATTCCAACAGCAGCTGTTACAACAACAACAGGACCAACAA 

AATCAACAACAATTATTACAACAACAAATAGAAGAACAACAACGGGTTCAGGAACAGCAGCAACAGC^ 

AAAGGGAACAACAGCAGCAACAGCAGCAAAGGGAACAACAACAACAAAGGGAACAACAACAGCAGCAACAGCAGCAAC^ 

ATTATTGCAAC^^CAGCAACAATTTCAGCAATTACTACAACAACAAGGAAGAAGAAGAGGGGGTGACGATGGTGATGAAGAAAGAGAGGAAAGAGAAGAA 
GGGGCTGAAAAGGATGATTGTGTGCGTAAGGTTGCAGJVATCAGTAGCGACAAAATATACTGCTGACTTC 

TCCAATCTAAAATAGCATCAGCAAAATTGGGAACCCrTGTCTTTGCCACCCCTCCTTCACCTATCATGAACTTGACAAAATTGAGAGAGGAATATTCCAC 
ATTCACAACCCAGTGTTTTTCAAAACTAACAGCTGAAAATAATAGTATTATGCGTATTTTCCCCGAGAGGATTGTAGAAGTATGCAAGAGTAAGAATCTC 
AATTTAATGGGAAAATACTTGTACATTATAACTACCGXTACAAACAGAAATGGAAGATCGAGTGAAGAAT^ 

AGTTTTCAAACAATGTAAAACAACAACAACAACAACAAGCTGCTTCAGCTTCTTCTACTAATCCTCCTCCTCCTTCTACTCCTTCTACTACTCCTCCTGT 

TACAAGCATGCAAGTTTGTGAGTTGGATGATCAACGTACCCTAGAAAAGGCTGCCATAGTAGAGGCAATTACTCTGGCCAATGCTGTACTTCAAACTACA 

AAATCCGCTTCAGCTCCTTCCACGGCGGCTGAGCGAGAAAl^GCTCTAAAGCTAGAGAATGGGAAAACATCTATCCGTATGGAAAAAGTGGATCTAAGTT 

CAGGAGCTACTGGTGTTTCCGACCAACAAAAATGGATCGACGAAAGTACTTCCAAACAAGAATTGGAAGATTTCATTGCAGAAGAAAA 

TGCACATAATGAAATGGATATTGGATTAATTTTGGATGCCAAGAAGAACGATCCGACCCGT^ 

GTGCAGTCTTTCCCArATTACGTACTCCGCACATGGCTAGGAGAAACAGATATATTGGATGAAGATACTGTACATCCTGAATATTTCCGCCAATACATTG 

ATCGCAATTGGAAGGrGGAAGAACATGAGCGTGAAGATACATTAAAGGCACTGGGTGTTTCTTTATCAGATACGTTGGCACACATCAAGGACTACTATTC 

TCCCAGTGTCAAAAATGATGCATCAAAATCAGTACCATTTGCGTTGAACACTCTATTGTACAACATATTTGCTATCGACGGAGGAATGATTTCTAGCCTT 

TCAAGAACAGCCTTTATTTACCGTAAATTTTTAAGGCAATCTATGACAGATAAAGAGGTTGCTCAAGGACCTGTTCGGTCTCAACTGTGTGAAGCGACAA 

TAGCGTCTCTTTTCACGGCATGTAGTAACCTTCTTCGATCGTCTCCCTTAGCCGATAAAGTAGAACCACGCCTTCAAGAAAAATTAGCGGCCGCTGCTGC 

TGTAGACACGTCAACCGGAGACATGTTTCGTATACGAGTTTGTCATCTCATGTACAATTTTATAGTGGCTTATGTGAACCTATGCAATAATCGTATAAAC 

TATACGTTAAATGTATTGAGAGCGTCGGGTCTGGCAAACAAAAAAGTCGTGGX:CGGTAAAACAACTAAAGGGCATACATCTTCTTCCCACCGGTTTGGAT 

CT7ATGATGTCACATATGATTTTTCAGTATTGTACAAGATTCTTCAACTACAAAAACAGAATATCTCTTTACTTTTAGAGAAGCGTTTCAATGCATGGGA 

ATCGTGTGTTGCCGCCATGGCAGCCTTCACTGCCGACCCTTCTCTCTCCATCTCAGATGCCGACCAATCCATACTCTTCCCACTAGAAGGGGGAGAGATT 

GTTATTGAAAAACATGAAAACGATGCAGAAAAAAATGTCGATATGGTTCAAGAATTGTGGAAGGAAACTGCACTCACACTCATGGCAAAGGAACTAAATT 

CATACTACAACTGGTTCATATACAGTAAGGATACCGATATGGAAAAGTTGGCGAGAGTGTGCAGGATGATTATCGGTATAGTTAAGGCTGTTCTCAGATT 

AACAAATAAAGCCGAGAGTTTAGTAGACACTAATGCTCTTTCTGATATTTTCAAGCTACCTGTGATTCCAATTGATGATACCAAAACTCTGGCAATAAAT 

ATTGTGGTGTTCACTTTGAATAATGTAATTAAACCATGGATGGTTTCGTTCAAGCAAATGTTTAGACAGAAAGACGGGGGAGTTTCTTCAGCCTATTTCT 

CGTTTCAAAATATCCAACAACAAAAACATCAACAAACTGCTTCCATTTTAGATGCATGGGCTTGCGCGCCCGGTAAGTTAACAAAGGCAGCACACGTGTT 

TATTTCAGGATATGAAAATCATATCAAATTAAAAAAAGATGATCTTTTATGGGGTGCTTCAATGAAATTCCCTGCCGACGGACGAGGCACTGTGGTCG 

GGATGGGCCCAACAATACAATAACGAAAGCGTATTGGAAGATTTTACCGACTTTTCAA 

ATCCTTTACTTTCTTCCATGTTCGGTAAAGGAAATGGTGGAAGCAGCAGCAGCAGCAGCAAGGAT 

CCGCCAAGTTGTTGGACAAGAACAAGCACCACCAATAAACACGTCGTCGGATACTAAAAAAATAAGACGTGATGCAAATATTGAACCAATAATAGGGACG 
CCTTATAGTGTAATTAAGGCAAGTAAAGGAGTATCAATTTCAGTACTGGATGATTTCAATGAGGATAGTCCAGAAGArPTCGCCCTTAAAACTTCCATCA 
TCAATGATGCCATACGAGAAATAGGGCAACGCATGACTTATACAAGACCTATATTTGATCATCAAACACAGAAAAATATACACTACAGTTCACCTAAAAT 
TATTCTCGAAGGATCAGATTTAAAGAATGGACAACGTTCAGGACAATCTTGGGCTCCTTCTTCATCATCCTTGACTCTGGCCTCCGATTGGAATCTACCT 
TCTCTGGAGCTTTTATATAGAGAACTTGCCACAAAACAAGTAGAGAAGGAAGAAGAAGAAAAGAGCGAA^ 

AAAAATTATCATTTGTCGTGAATAAAGCTATCGGAACTATCCAACAACAACATCAATATTCTGAAAGGGGAGGAGGAATGAAGAGGTATCAGCAACACTC 

TGCTGATCAAGCTAGTAATGGTGGCATAGATGATATAGAACTTATGAATAGTAAAGATGCTACTTCCATGAGAAAGGCAAAACTGGCATTAGCCGTTACT 

AATAAAATTGCAGCAGCAGCAGCAAGGGATGGGGAAAATTCATCAGCTAAACCGTCAAACTTTGGCAATAGATTGGATGAAGCAATAAACCCTG^ 

TTTTATTACGTAGAGGAGGAGGAGTAAGAGGAGGACAAACACCCCAGAGTTCAATGCTAACAATGTTCCGTCCTGGACyVAACTGGTGGCAATAGTAGTTG 

GTGGACTACTAATACACCCCrTATTCAACGCACAACTAGTGTTGGGAATAATTTAGTTGTGCTTGTACCCAACCTATTGGATTCACACCCTCCTACATTT 

AATTAGGAATAAGGAATTATACACATTTATTTGTATGTTTTACATGTATTTTATTCAACAATAAAGGTTATGATCATACATAATCAACTGGTTGTTTTAT 

TACATAGTTTCCATTACTGACCCTTCCAAAAATGAATTTTGTACCTCCGAATACAATTTCTTCCCTCGGTTTACAAAGGAAATTGCACGGAGCGAATCTG 

CTCACCCATTCACCAACAACACCCTCATTAATCTTCCCATGAGTGATGTGAGGATTCAGTCTGTTGTCAGCGAGTTCAACATTTCTCCCTATTGCATCTG 

AAATGTGTTCCATTATTCGTCTACGCAAATCGAGTGCATCATCTCCAACACATACTTTGGATACGAGAATGTTCTTGTTCATCCCAAACAACTCAA 

ATCCCCAGTGTCGATTTTGATAGCTGGTGCAAAACAACTCCCAAAATCTGTTTTC 

ATGGTCACATGTGGAGGCAACTGAGAAAATGATTTGGAGACGTGTTTCTCAGATAATAATTTTCCCGTTTCTGAATAGAAAGGAAGTAACTCTTCGCTGT 

AGAGAGCAAAGAACATTTTCATTTCTTCACCTTTAAAATCAACTTGATTTATAGTGAAGATTCTATTGTTGTGTTTGATTGAAAATTGTTTCACCACTTC 

AAACATGCCATTGATTTGTTTGTACATTTGCTTGACTGCGCATCTTCTTTGCACCTGTGGTTGAGTAGATGAAGGGGAGATTTGGTGGCCATTATCTGGG 

TGGTATCCATTCAATATGTTGTAGCCGGCACTUiGGGATAACTGGGTCCACAAACTTGAAAGCCTCTCTATTAAACCCTCTCCCTAGAGGAATAAGTC 

AACGAATTTCGTCTGCGTCTTCTGCCTTGTCCATTAACATCATATTTAGATGCACAACATATCTCAGAATCCCATTCAAGGTGTGTTTTTCTGGATAGAG 

GCAAATAACATCATCCACATATTCCCCATCTACATAAAGTCCAGTAACAACAATCTTTAACGCTTTAGATAAAACAATAGTTTCTCTATGAGAAAATATT 

GCCTTGTTTACTTTGGTGGAGAGTTGTTCGTCACTGGGAATGGGCGTAGAGGCCAAGAGTAGAGCCTTGGAGGAAAATTTCTTGTCCATAGGAAGTCTGT 

GCCGGATATTGTCTACCATGTTAGACATAGAAGTATTATCACTAAAGTCAAAGAAGACTACTCCATTTTCTGGAGAGGGTTTGTTGAAGGATAGATGTTC 

TTCCATGGTGGTG<n , GGTGGTTGTGATGGTTTGAATGAGGTGTGGTACAGTCATTATATACACGTGGGCCCTTTTTAAAAGGGGGAGGGAGGGGGAGAGG 

GTTGCCATCAAGCAATATCTTCGTCCAGAAACACCTGGTCCAGAAATGGCCATAAGATACTTCCTCTATTTCTGGAGCTATTACATTTCTGGTGTAATCT 

GACATTGGCCCGACCCAGAGGTCCACCCTCGGAACTTGACATTCGGTCGAGCTAGCGGTCCACCCCCTAAACTGGAGCGCCCTAAAAAATTTTTGAAAAG 

TTTTTGAGATGGAGGAAGAGTAAAATTCTCTAGTGAAAACAGAAGGGTATACCCTCTCATTTCTGGGTCGACCAGTCTCCAGAAACGCCTGTTCCAGAAA 

C AC AC AAAAGTT AATGT AC GTTTCT GG AGC TATT ACATTTCTGGT GT AATCTGAC ATTGGCC CGACC C AGCGGTCCACCC TC GGAAC TTG AC ATTCGGTC 
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GAGCTAGCGGTCCACCCCCTAAACTGGAGTGCCCTGAAAAAATTTTTGAAAAGTTTTTGAGATAGAGGAAGAGTAAAATTCTCTAGTGAAAACAGAAGGT 
TATACCCTCTCATTTCTGGGTCGACCAGTCTCCAGAAACGCCTGTTCCAGAAACACACAAAAGTTAATGTACGTTTCTGGAACCAACAATTTCTGGTGTA 
ATCTGACATTGGCCCGACCCAGAGGTCCACCCTCGGAACTTGACATTCGGTCGAGCTAGCGGTCCACCCCCTAAACTCGAGTGCCCTGAAAAAATTTCTG 
AAAAGTTTTTGAGATTGAAGAGGAGTAAAAAACTCACTATATGAAAGGTGTGTAGAACACCACATCCTTTCTAGGCACGGCCATGTCCAGAAACGCCTGT 
TCCAGAAACACGCAAAAGTTAATGTACGTTTCTGGAGCTATTACATTTCTGGTGTAATCTGACATTGGCCCGACCCAGAGGTCCACCCTCCGAACTTGAC 
ATTCGGTCGAGCTAGCGGTCCACCCCCTAAACTCGAGCGACCCTAAAAAATTTTTGAAAAGTTTTTGAGATGGAGTAAGAGTAAAATTCTCTAGTGAAAA 
CAGAAGGTTATACCCCACCTTTTCTGGATGCGACTAGATCCAGAAACGTATGTTCCAGAAATACCCATAAGTCCTTTCCTCTATTTCTGGAACCAACCAT 
TTCTGGTGTAATCTGACATTGGCCCGACCCAGAGGTCCACCCTCGGAACTTGACATTCGGTCGAGCTAGCGGTCCACCCCCTAAACTCGAGCGACCCTAA 
AAAATTTTTGAAAAGTTTTTGAGATGGAGGAAGAGTAAAATTTCTTGCCTGAAAGGTGTCCAGAAGTGTAAACACACACACGCATTCTCGCACCTTGTCA 
CCATCATTAATAAAAAATGTCCCCTCCCGCTCCCCCCTCAAAATCGGGTATAAATAGAGCTCACCAGCACACCACAGACATCATTCTCAAGACATTTCAA 
GTACTGAGAACATCCTCTCCTCTTGTGATATTATTCAAGAAAACCTACTGAAATCGTCCCTAAAATGGATTTTGAAGGAACTACCAGTTCTACCCCCTCA 
AAAATGTCCCAGTTGTATTCATCAGTGAAGJUIAGTTGCAGAGCATTCCTTTGCCAATCTTCATGACAAGGCTACTCTTGCATCAAAGGTTATTAAGGACC 
TGGAAGGGGAGAGGAAGAAAATGTCTACCCCAAAGTCCTCTTCTGATGGACAAAAACTGGACAAGGCTATGTTGGACGATATTATCAACGAGTATCAGGC 
CGTTAAGAGCACTGCAGATAATTCCATTGAATCGACCATCAAGGAAATTGAAAATGTACTTGAAAGTGTGCGCAGAACCAAGATTGAAAGTGAAGCCAAG 
AACAGTGTAACTTCCAGCCCAGAAAAAGTGTTTTCTGTCGAGGATTTAGAAATCTACTCCAAGGGGCGAGTGTGCAAAGGTCTCAAGTTAAACGCCAACT 
GTTCAAGAATTGGAGGCAAGTATGCAGTGTCAATGAGTATCAAAAAACACAACGTCTCCTCATTTGAGAACAACAACAACCAAGTTTTCTCTGAAGAACC 
CAGGGATTGTTTTATGCTTGAAACAACCTATCCTCTTGTTGGGTTCGAAACTTCTACAGAAGATGGAAATACATATGCAGTTTTCTTGACTGGTGTTGGG 
CTAGAAAGATCTCTACCTAAATATGTACCAGTTTTCGACATGAATGCAGGTATTCAAACCCTAAACATGACTGGTTTGAGGATGGCCAAGCTTCCTGTTC 
TGTGCATGTTTGGACGTACAGAATATGACAACTTGGAAGATTTTTACATCACTTCAATTGAGACGCAGTCTTTTGACGAAGAGGAAAATGATGCCAGAAT 
GAGGTGTCACACCGAAGATTTGGAGAGGAAGAAGCGCATGAATGACGCACCAGCGATTACACCTCATGTGGCCGTGTACGACTACAGTGGAGACGGGAAA 
GAACAATTGCTCTATATGATAACCGAGTATGAAAACACGGCTAGTTGGTGCAACGCAAACGGTGTGGTCACATCTGACAGTGGATTTTCTAACGAATGTG 
CAATTAGTGATATGAATGACTTGTGCTGTTTTGCTGACTGCATCGATGTTACTGTTAATAATGAAGAACATGAAGAACGTTCTATGAATATTGTGGTCGA 
ATCTGACAGGCGTCTTTTTGATGCTAGTCCTTCCCCCATCAAGACGGAAGAAGATGGAGAAAATTCATCGTCATCGTCTTCCTCTCCAACAGTTCCTCCT 
CCTACACCATACGAAGGTAACGCAGTTGTGGAGGGGGAGGAAGAAGAGGAAGAAATTGATGAAGACGAAAGTAGCAAGTATGAAGGTTCAGAAGATGCTC 
TTGTTATGAAGAAATTAGCCAAGCTTTCTACTATGAAACAAATGAGAAGGGTTAAGAATGAACCTGCACTCAAAATTACTTCTGGGGGTAACAATAGTAG 
CAGTAGTATCAATAACGAAGATGATGGTGATGATGACGATGCCGTTGACGCTACTGCATTATGCCCCCAAACTGAAGCTACAGTGAAAAATTCCTTCATG 
GCCCCAAACGACGAGAGAACTGAAAATATTTTGTATGAAACTATGCAAATTTCTCTTGCTAAAATTTGTAATAATCCATCATCTATGAGCAGrTACCGT'G 
TATTCACCAACAAACTCCAAGAGTGTTTGAATACCATGGACGATAGTATCCGTCGCCGTCCAACCATTTGGACTGAAGAAAGTCAACAATTTGCTAAGGG 
TTTGTTGTTTGATGAGGTTGTCACATCAATTGTGGCACATCAGATGGCTCAAGATATTTGCAAGTCTGAAATATTTGGAGGAATGTTTAACGCCAACTCT 
ACCAACATTAAGGGTAAATATGAAGGACAAAAGAAGAGTCTGTATGGAAAC^AGCACATTTCTTCCTCGTGCTTCAAAACCAACACGGAATCTAATG^ 
ATAATGCACTATTTGCGTGGGTGAAATCGAAACTCCATTCTGGCACAGTCATACCTAACGTATTCTCCTTCAAAATGGCATCAGAAAAGCCCTCAAAAAT 
, GAAGCGCAAGCGTACCTCTAGTGCTTCATCATCTAACGATGAACACCAAGAACCATCAACAAAAATGATGAAAAATGATGAAGGGGAAAAGGTrGCACAA 
\ GAATCATCATCTCCTTCTTCATCGTCTACACCAGAACAACAACAACAAGCTGGTCATGACAAGGAAACTATCAATTTAATTCCCCTCAGTTTCATAAAAA 
-TGCCACGCAGTAATGTCAATGGCTCGGCTTCATATTTGTCTGAAATATTCGGTCAACGTCTTTGTGGACTCT 

GTGCAAGACTTTTGAAGATCTTGAAAATGAAATCATGAGGAGCTCATTCACTAGACTGACTAGATATGAGAGGGAGGTAACTCGCTTGTATGAG 

AGGTCTCAAGCTGTAGATATTGAGGAAAATGAAATGGATGTTTTGTCTCACCAAGGGGAATTGTTTGCCGAGTTCTTGGAGGACCCGATCGCTTACTTTG 

AAGAAGTACTGGAGAATATTAAGAGTTGGAGCCTAGAAAACGTTAACACCCCTAAGCGCAAAAACAAGTATGCAAAGGTACTGGTGAGCGTTAATGCTAT 

TCGTAGGACATATGAAGAATACCATGCGTTTAGCAAGTTTGTACCAATGTTCTTGTTCAATCTGATTAAGAGAGAATTGGAAGGAGACAACTATACCCAT 

GACGTTCACTTTTCrTCCACTTGCCTGTGGTACCTGACTGTAATGACCAGGAACAGGATTTGCGATGTGCTCCAGTACATCAACAACAACAATAATGATA 

ACGAAGAAACCGATATrGTGGAGGAAGAGGAGGAAGGAGAAGGAGAGGAGGATAAAATGGAAGAAAGTATGGACGTAGAACAACAGAAGCAAGTTCGCAA 

GGGAGGGAGAAAGAAGGGTCAAAAATTCAACAGTATTGGGGATCAAGTCATTAGAAAATTTGTGAAAAGTTTGTGTGAAAATTCGATGGTAGTTTCTATT 

GCAATTAATAGTTTGATCTCTGGAATAAGCTGGATGAACAAGAAAATCCCTCCCGGTTTCTTGAAGGATTCTAGCACAATCAATACCCTTGATGAGGTCT 

CAAGGTTTGTGTTTAGCGATGTCAAAATCAATAGGAAAATCAATGGAACAGATGATAAATATGAAACTGTTTTTGGAGTCAGTACGCGTGTGGATTCACA 

TATTGTAGGCCCCTTTAGTATACCTGTTGATTTTTCAAGCGCAGGACTAGATAAGGCCTCATGTGGCAAATTGTACGTTAACACCATAGACGGAAAGGGC 

ATTTTGACAATTTCACCCAAATATGATTCATTAAACGATGAGGATGTTGATTCTACTACAACAGACAAGCTAGAGAAGGATATTTTGCATTTGTCTAAGC 

ATGACACCTTTTTCAATATTAATAAGAATAAGGTTCTTCCATTCTATAATATTTCTCCTAGCTCTTCTCTCACTGAAAAGAAAAAGACAAAATTCAATAG 

GAAGAAGATCTCATCTGGTATGAGCAATAATAATGGCATGTGTGTACAAACTCCTTCTAGTTCAAATTCAGTCTCTTCCGTCTCGTCTATTGTAGCTCCT 

TCATCTTCTGTTCTGGCTCTATCTTGCTCCCTTTCTTCTACAAAGAAAAAGAGCATCTGGAACGAGAACATGTTTTTGACATCTAGGAACATGTGGAGGT 

GTGGATTTGTTGTACCACCCAAACTTTGCAGTTTTATTGTTAACCATAGACACGCTGTAAAACTTGTAGCTGAAACTGCACCTAAAACAAAGTTGTGTAG 

GAATATTATTGATAGGAATAGGAAGATTAGATTTAACGGTCTAAAGAAGGTATGCAAGAGTGTTAGCGCCTTTACCGGCGAGTCTACATATTTGCTCAAT 

AAGAATATGACTGCAACTTCACCTAGTGATTTGAACCTATGTATTTATACTTCATCTTTAAATGACCCATTGTATACTTGTAAATTGACCCATGAAGAGT 

ACCAGGACGGAAATGCTTTGGATGACTATGGTGCAGTTTTTGTAAACTATACATTTAAGAGTATCAAATCCTGCTCGTCCAAAGATGAAACCGCTGACGA 

CAATGCTGCTGCTGCTGATGATGATGGTTCTACTACTTCTACCTCTTCTTCAACTGATACTGATGCTGCTGCAATCCAAGACTTTATGCATGTAATGATT 

AAGAAAATTGATGCCATGAAAGACATTAGAGGAAAGTACAAGAAATCTTTGGCAAAGAAAACAAAGAAGCATTAAATGTATACCTTTATTTTTACCAATA 

AAAATTATATGTTTTACATGTTGTGTTTTATTCAAATTCATATGGTTCATCCCCATTTAAATCGTCTAACCCCATATCCCAGTCAAACGCTGCTGCTGCT 

GCTGCTGGTTCAATGTCTGAGCCATAGATGGATTCATTGACATCTAAAAGTTCATCTCTAAAATTTCCTGGTGTACGCGGTTGACGAATAATTGGTGCAA 

TAAATGCCTCCTCTTCTTCTTCTTCTTCTCCTCTGACTCCCGTTTCCTTGACAGCTTGACGTTGCGCTTTTATTATCTCTTCAACGTTCCTCCCTGGAGT 

AGCGGGAGGAAGATCAAAATCAATACTGCCAAGCTCAGAGTCTATATCAGTAAAGGACATATCAGTAGCGTTTTCTTCTTCTGCTCTTGCTGTAGACGGA 

GGAGGAAGAGGAGTAGGCTTGGGTAGAACTGGTGTAGGTGGAGGTTGAGGGGTAGTAGAACGAGAGTCAAATCTTACACGGGAAATAGGTGGAGAAGGAG 

CAGTATTAGGTATAGGAGGAGGAGGAGGGGGACTAGAAATGAGTTGAACCGGAAAGGGCTGCTGCTGCTGCTGCTGGGGAGGAGGAATATTACTGGGTGA 

TTGTTGTTGAGGAGTAGGGGGAGGAGGAATATTATTGGGCGGCTGTGGTGGTTGCTGCTGCTGCTGCTGCTGCTGCAGCTGAGGCGGCGGCTGTGATTGG 

TTGCCTAAAAAGTACAGCGGGTACTGTTGTTGTAATTGTTGAGGAGCATATGGGTTATAATACTGGTTATATTGATAATAAGGATTTAAGTATCCTTGGT 

AAGTAGGTTGTTGAACCTGGAAAGGTGCCTGTGGTGGAGGAAGAGGAAAGGGAGGAGGTTGGAAAGGGGGAAATTGTTGTTGTTGTTGCTGACGTTGCTG 

TTGTTGTTGTTTGGGCACAAACACTCCAGTCGATAAATTAAATGCTTGACTTGTTTCCTCCACTGCACTCTTTATCATTTTTTCAACCGGTCTTGCTGAA 

GCTCTTGCTAAAACACCAGCTGCACATTTCTCCAGCATGGAGATAGAATGCTCTGATTTACTCCTCATCCCTCCTCCAACCAAAAGATCCATAGTCGATG 

ACATTTTTACCCACACGTTATTATTTATAGCTCTAAAATAACGAGGTATTGATAGGCCACGTACAGTGGCGTTAGAGTTATCAGAAGCTACAGATACAAA 

ATTCTTATCAAAGTTCTTGTTCGGTAATATACGTGTACCAAACAGGTTATTACACACAGTAGCAAGTTGTTCAAAGTATACTCGTTCTTTCACAGTTCGC 

TGT AT AC G ATCCTC C ACGAT AC TG CTC ATT AAATT GT TT ATC TC TGTTGC ATTTCCATC G TT AAG AG AT T TC AAT ATTTTTG AAC TT AATTCTT GCC ACT 

GAATCTCTCCTGACCGTATATCTTTCCTATG<5GAAATTGCCCAACAACTATTA 

AGTGTTTGTTCTTTTTCCTCCGAACTTTTTCTTCCCTTGTTCAGATGACAATAAAGATGATGAAGTAGGAAAAGCCTTGGTTTTATTGGCCTTTAAAAGG 
TTAGCTAAAAGAGACCTTATTCTGTACTGTATAGCCTTCTCCTTGACCACTGCCTCGTCTGCCAAGTATGACACGTAATCGTATTGTGGCGCATTTTCAC 
TCCTTACACCTAAAATAGAGCCCGAAGTTATAGCACCTCTCAATGGTCGAGACACGGTCATCATTTTTCCAGCAGGCACTGCTAATGCCTTTGATTCTGT 
GTCACTTCTCTTTCTTTTTTTAGAGGGAGGAAATATGTGCTTTTTCATTTGTTTTGCCCGCTTCGATACAGATGTAGTAGGTGTAACATGTTCGTCTTCA 
ACATTCTCCTCTTCTTCCTCCTCTCCATCTTTTTCTTCTTTCTCTTCACTCTGTTCTTCCTTTTCTTCCCGCTGCTGTTCCTCCCCTTCCCCTTCCTCTT 
TCCCTTCCTCTTCTTCTTCATTACCTCCTCCTCTTCCACCACCTTCTTGATCAGTGTCTCTTTTTCCCTCATCATTTTCTAAATCTGAACAATCGCCAGC 
AACACAATCGCCACTCCTTGACAGAATTTTACTAGTACGAATCTTACGCCTTTGCCTATTCTTACTAGAAGTTGGCGCGTCCAATTTAGAGACCTCCTGA 
CCCATGTTTCGTTGGAATCTGGATTCGGCGTCATCCTCCTTTTTTCGTTGTTGGTTCCTAATTATATGCTTTTTTAAAGCAGTATTAGGTTTATGTTTAG 
GTGGCATATTTGTAAGGTGGGATAAA<^TTACAAGAAAACAAAAATATTCAAGACATTATTTTATTTTGTAGGCACCGCTGTTTTAGTATGTACATCAAA 



FIGURE 2, Sheet 1 1 




099 JJiH-i&i 4- « 0^0'*+05 



1^ 



WO 01/38351 PCT/US00/28888 

13/201 



TTCAAAACCATCTTCCATTACATATATCTCAATCTCTTCTTCTTCTCCTCCTCCTCTTTGTTTGGACAAATAACTATCACTACATTTTCTTAAAGCTAGA 
CAACATAGAGTACAATTATACACCTTTTCTAAAGCAGCCTTTGTAGACTTACAATCAGAATACAATAGTTTATGGTCAATTAGTATATGGTCAGGGTTAG 
TTATTTCATCTTCACATATCCAACAAAAAACTGTAGGGTACCCATCTCTTCTACCGCTACAATCATAACAAGTCATGGTAATACATCCACCTAATTTTAT 
GTT AGGTGT C TT AC ATTT AGGGC ATTTTACAGTC TT AATTTCCTGTC CT TCTTTC TC AAC CTTCC AT TCT AATGTCC TCTTTTC ATCC AC AGTGAAC AC T 
GGATCCTTTATTGATTCAGATGTCATGTTTTGACGTACAATATTTACCTGTTTAACAATTTCCATTTTTGCACCTTGGTTAGGTCTGATACCCGATAAGG 
CACGCCTATCTACACAATGAAGGACAAAAGAATGGGCTCTATGGAACACACATTCTGGCCGCTCCAACACCATTTCCCCTATAAGATTCTTGACAAACAT 
TTTCGCCCATAATCCTAATGCACTCTCATGGTTCAATTTTTCATAGACATAAAAATAGGTTGCAATTAGAGGTATACTCACACCTACAACTTGrTCAAAA 
TCTCTATCATCAAAACCCCCAAAAACACTGTTTAAGTAATGGACTTTAACCCTATTCTGAAACCTCTCCCTGGTGTTGTGGATTACAGATATGTCGTTGA 
GAAGCCCCTTCATATTCTCTGGCATACGATCAGTCGAGTGCCAGTTGGCTAGCGCTAGAAGAACCTTGCCCTCCGGAGACTCTGGGTTCCATCCTCGAAT 
CTTCTTTTCGGGAGTTCTTTTTAGTATAGAAACAGCCTGCCTACAATTTGTCACCTCTTCATTTGGAGGTACAAAATTTGACGCCAAGAAACCGCAGGAC 
AATC TC ATGGGCC AT TT ATGGAGTC CGGG ACC AGT ATC AT AT CTTACT AGCTCAT CGGTGGGAGGGT C ATC AAATTC AGC ATCTT CT AC ATCCTGGAAGT 
AATAGTCTTCCAGTTGTTCGGGAGGAGGGTACGGCAATACTGGAGGAGGTACATCCACTGTTACAATGTCTTCCATCTCATTAGGCTGGTCACATACATT 
GGGTAGTAAACACTGGGTACAGATCAGGGAACATTrGCTATCACCAGTTTCTCCCTCCACCATCTTAAAGAGTTTAACGGGCGGTCTGAATCTATTATCC 
ACCACAGTAAATTTTGAATCCTTGTCTAGTGCAAAGTCCTTGTCCATCTTACACATCTCCATGAATTCGGCCTGAAGAAGTTCCACAGTATAGAAGCATT 
TCTCTGCTCGCZTTGATGAGTTCGGTGATGAGAAGTrTGTTGGCTGCAATGTATTCTGGGTCGACATTTTCTCCTGCTAAAATCGCCCTCTCGAATAAAGA 
TGAAAATTCGTGACATCTACCTATAAACTGTACCATATATTCTCCACAACATGGGCACCTAAAAACGTCCTTTAGTTTTTCACTACCAGCCGAGACCTTT 
TCGTACACATACATTAGGTGTTTTATAGTACAATCAGGGCATAGTTTTGCAGAACACGTTTCTTGTCGTGCTTCAAAAGTGTCCATGAAAGCATGTACGG 
GGAAATAATCGTAAGAGGCTTTCATGTTTTCATGAGAACAGTTGAAGAATTTTGGTTTCATGTCCAAATCCGCGTAGCACAACCCACATGTTTTATTGGT 
GGTATTCCCACTCAATATCTTTGTAACGTGTGCTTGTATTACCTCCACTTCGTCAGATGGTTGGTACACCATCGCCGCTTGACTTTCTCCCCAAGACTCT 
TCTTGCAGTGTCTGGCTAACCATGCATTTGGCGATTGCTTCTTCTAGATGCATATTTTCAGCGTTTTCATGTTCGAGGATGAAAGATCCTAGTTTTCTGG 
CTGAAGGTATGCCGTGTCTAGATACCAGGGACAGTGGTGTATTTTCTAAGAGCGAAGCGTTCGGGGAGATTATTAGACCAGAAGGCGACATTACGGATTT 
CTTTGGCTCCTCTACTCTTTCAGTATCTGCTG<:CAAAAGATAGGGAACATTCTAGATAGAGTATCTGTACACTATATCAATCATCTTGGCTATTTTTGTT 
CCAAAAAGAATATGGTGTAATTCTTGGGGCACATArTCCATAGCGTCATACACGGGCCTGGCAGCGTACATGCGAAGATGACCCTTATTGTAGTGGTTGT 
TGGTGGATGGACCACTTAAAAATGCTTCTGTTACGGAGAAGGCCATATTTTCCOIAATCCTGTCATTGTTTCTTCCTTCATGCATATCAGAACCAAATAA 
AACACTAAACGTTCTAGCGTCTCTTATTTTTGTCCCCACTTGCAAAGCAGAGAAGGGCCACATGTACCACTCCTTATGAAACACATCCTTCAAAGTGTTG 
AGAGAAACTACCACATCTTCATAAATGACACTTTTCATGTGGCGGTGCTTCTT<3TTCTGATCTCG 

AATC C A T ATC TGAT ATC ATCTCGGC GT ACGCCTC AC AC AACGC ATTCTT CTCCTC C ACT GC ATC GCC TGCATTTCC ACGT AT AATTT C ATCCAAC AT C AA 
GCGCGTATTTTTCATGACAAACATGCCACTATAAATACGATGAATTAGATCCTTGTATGCTGGCCCATCTTGTTTTTTGCAACTCTTTAAATGTCCAATG 
AATTCTTGTTCGTTTTGGTCAAGAGTTGTAAATGGAACTGGGTCGTCGTTACTTTTCATCTTGATGTTCATGTCGAGGGCTTTCTTCATTAATAGAGATA 
TCTCTTTCAAAAGACCGGTTGGATTAATCACAACTGTTTCTCCCTCCAAAAGTTCATCCCTATAAGAATTGCGGTTGCAGATGTCCAGTATGGCCTCCAG 
AAGATCACCACTTTCTCCCACTATACCCAC^TTCGTrCATAGGTGGACCTCTTATTATTTCTGCCTCCCATTAGGTCGGTGATCAAGATTGCAACTTCT 
GGGACAGCAGCT<^AACTATGGATGTTACACCTTCCArGTTGAACGAGGGAATGAATGCGATAAAGTCTGGGGCCTCCATCTTATATACCCTTCAGATGT 
GAAAAAAAACAAC^TGAGGAAAAAAGTAAAAAATTCAATATACATGTATGTCTTTATTGGC^ 

GAACATAAAAGAGCAGCAGCAGGATCTGGAACAACAATAGAGGACTTCTTCTCCTCTTCTTCTTCTCGGACAGGAGCAGAAAAAGCTGCAACAGGAGAGG 
GAGCTTCCTTCTTGACTTCTTGGAGTACATTCTTGTCGACTGTGAACTGGACAGCTCTAGCTGCGCCCTTTGTGCGTAGATAATAGAGAGTCTTGATCCC 
CTTCTCCCATGCATACATAGTCATTGACCGCACCTTGCTGAGTTCGGGTTCTTCCACAAACAAGTTGAGGGATTGAGCTTGGTCAACAAACATACCTCTC 
TGAATAGCCATGTCCAAAGTAGTACGAGGATTAATTTCCCATACAGTTTTGAATAGTTCCTTGGTCGATTTAGGGATATTAGGAAGCGTCTGAATAGATC 
CACCACTTGCCATAATCCTCTGTTTAGTTACTGAATTCCATTCTCCCAGTTTTATGAGCTCTCTAATCACATATTCGTTCACCACTTGGAATGATCCTGA 
AAGTACATTACGATTATACATGTTGGACGTTAAAGGCTCAAAGGATTCAGAGTTGCCCAGGATCTGTGCAGTGGATGCAGTAGGCATGGGAGCAACAAAC 
ATTGAATTGTGAACACCATACTTCATAATGTCCCTTCTCAATTGCTCCCAATCGTGAATTGGCAAAGAGTTGAAATATATGTCCCTATTCTTAATATTTT 
CCTTCCCCATGTCAAATTGGAAAATCCCTTTCCTCAAAGGACTACCCTCAAACAGCTCATATGTTTCTCCCTTTTCTTTGGCAATTTCACATGAAGCTTC 
CAAGGCACCATAGTATATAGTTTCAAAAATCCTCTTGTTAATTAGTGCCGCTTCTTCAGATTCGAAGGGGATTCTGAGTTTGAAGAACAAATCTGCTAGT 
CCCTGCACACCCAATCCCATTGGCCTAGTTTTCATATTGGAAATGCGGGTCTTXn , CAACTGCATAGAAATTGACATCAATCACCTTGTCGAGATTTCTGG 
TCATGATTTTTACAACCCTCTTCATCTCCCGGTAATCAACATAGGGCCTTAGGGAAGGGATGGGAGAATACTTCACAAACTTGTTGACTGCGATAGAAGC 
CAAATTGCACACTGCAGTTTCCTCCGAATCACTGTACTGGACAATTTCAGTGCACAAATTTGAAGACTTGATGATGCCGACATTTTCTTGGTTAGACTTT 
CTATTGATGGTATCCTTAAAGCACACATAAGGTGTTCCAGTTTCGATACGTGCAGAATTAATTTGGTCGAATAATGCACGTGCCTTCACCACCTCTTTAC 
CTTTGCCTTCAGCCTCATATTTCTCGTACAACGCCrTAAACTCTTCGCCATGGACGTCGGAAAGGCCAGGGCACTCGTGGGGGCACATCAGGGACCAATT 
CTTCCCAGCCTTCACTCTCTCCATGAAGAGATCAGATACCCAGATAGCTGGGAAAAGATCCCTCGTCCTCAAATCTTCATTACCGGCATTCTTTCTGCAG 
TCAATAAAGTCCTTCACGTCCAGATGCCAATCAGAAATATAGATGGCTGCAGCTCCTCTCCTCTTGTCTCCTCCCTGGCTAACCTTTTTCACAGAGACGT 
TAAAGATTTGGAGGAACGCCATGAGACCAGGGTGGGTACCACTCCATGACGAAATGGGGCTTCCTTTTGCCCTCAAATCATGAAAGTGGATGCCGAGTCC 
TCCAGCAGTCTTGGAGATGATTGCCGCCTCCTTAAGAGTATCATAAATACCCTCAATGCTATCATCTTGAAGGCCCAGAAGGAAGCACGAGGAAAGTTGG 
GGTGTGACTGTTCCACAATTAAACAGTGTGGGGGAAGCATGGGTAAAATAGTGCCTCGACATGAGATCATATGTTTCAATGACAGATTTGATGTCTGATC 
CGTGAATGCCGACAGCCACACGCATAATCATCTCCTGAGGGCGCTCAACCAAGATTCTCTTCTTTCTATCAGTGGGGGAACCAATCTTGATCAAATAAGA 
GTATTCTAGCGTCCTCAGTCCGAAACAGGTGAAGAGATAATCCATTTTGTAATCGATAACAGCATCGAGAATTTCAGCATTGGCCATAACATTTTCATAG 
TAGGTATCATTAACTACTGATGCTGGTTTACCAGTTCCAGGGTGGATTGCGTGCCTCAATTTCTGAGTTGTTGCACTAAAACTATTCCACTCTTTGGTGT 
TTTTGTGGATGTTCGAACAGATGAATCTTCCTGCCAGTTTTCCAAAATCAGGGTGGTCAACAATTTTTGTCTTTGCATAATCGGCCAGAAAATCGTCCAT 
TTCTTGGAAGGAGATGGTAGCGGGAAGACGGTCCATGATGTGAGATGCAAGTTCTTGAGGGTTAATTGCGTTCTTGTCAAGCTTGGGCACATACTGGTTG 
ACTGGAAGACAGGCATTTTCAATCCTCTTGATTATCTTTTCAAGACTAATTTCTTGCTTAGTGCCATTCCTCTTTGAGATGAATGATTGTTGCTGGTTAG 
AACCCATTTTATGTTATTAAATTGGGATGTTGTTATTTGGAGTATTTTCCTATACGTCTTCTCGGCTCGAGGAGGGTGAAAGATGTGAGTGAAGAGGCTT 
GGACGGTAGCTGTATTTATACCGGCCCGGTGAAGXSTCGCTGGATATTTTTGAAGGAGGGGAAGAAATATATAGTATTATACGTAGTTCTTTGATGTAGAA 
AGAAATCAGAAGGTGAGGTGACAGCTGCTGCTGCTGCAGTATTAGTAGTGCAGGTGCAAAACTGTTCCTCAAAAACTTTTTAAAAATTTTTCTGGATCAC 
TCGAGTTTAGGGGGTGGACCGCTGTCTAGGCCTGATGTCAAGTTCCGAGGGTGGACCGCTGGGTCGACCCAATGTCAGATTGCACCAGAACCGACTGTTG 
CTCCAGAAATGTGCATTAACCTTTGTGTGTTTCTGGAACAGACGTTTCTGGGCATAGCTGCGCCCAGAAAGGGAAGGTGCAACATACTGTTTTCGCTATA 
GAATTTTACTCTTCCTCACTCTCACAAAATTTTGAAAAATTTTTCTGGGTCACTCCAGTTT^ 

AGGGTGG ACC GC TGGGT CGGGC C AATGTC AGATTGC ACC AGAAACGACTGTTGCTCC AG AAACGT AC ATT AACCTTTGC GTGTTTCTGG AAC AGGCGTTT 
CTGGGCATAGCTGCGCCCAGAAAGGGAAGGCTGTATCTTACTGTTTTCGCTAGAGAATTTTACTCTTTCTCCCTCTCACAAAATTTTGAAAAATTTTTCT 
GGGTCACTCCAGTTTAGGGGGTGGACCGCTGGGTCGACCCAATGTCAAGTTCGGAGGGTGGACCGCTGGGTCG^GCCAATGTCAGATTGCACCAGAAACG 
GAATAGCTCCAGAAATGTGCATTAACTTTTGTGTGTTTCTGGACAAGTCGTTTCTGGTCTAGGAGGCAACTAGAAATATAAGCTTGTAGCCTAGAATrTA 
GTACACCAAGGATAAAGAAATATAACAAATACCATACTATGTTTTATTTGAGTAATATAGTATCCAAGTTAAAATAACAATCACGCACTTGTACATTAAT 
AGTCTGTAAAAGATGGGCAAACGTACACCTGAGAGTATTGTCCCGACAATCGTACATGAACCCTCTAGTGCACGTATCACGTTTTCGCCCTATTACCATT 
GATGCACAAATTTCCTCCTTCATTCTAATAGAGGCAGATTGTTTGTCAAAATAACACTCCCTATAGTAACAACCAGGATTTCCATTGGATTGACACTGTA 
GTTGTTCGAACGGTTTTCCTGTATAACATTTCCCAGCTGCGTGGTAGTGACGCACAAAACTCATGCAAGAATGTGCGTGGGTAGTTTCATTTCTAAGAAT 
CTCCCACGCACTAGAGGCACAGAAAAAATCAACTGCTGCGTGGCAGTGACGCACGGCCACACACCCGCATTCTTCACACCATAAAAGGACATGATTCGTG 
TTAGTCGTCACATCTCTCAGAAACCCCCATGTGCATTGACGTCATGGGGAATTGCTATTGCACCACCTTATGGTGTATAAAAGTGGCCCTGGGTGGCATC 
GGTTAGTAGACAGAAACAAACCGTCAAGATGGTGTCGTCTATTACCCACCTCTCTCTGTTGTTCGTCGTGGCTGTAGTAGCTTCCGTCGTTTTTACAACT 
GAAGGAGCTAGTGTGAGAGTGAAACGGTGTGCTCTTAGCCCGTGCCCCGACGTTATTGACCCCGACCACCGCTGCCAAGGGCGACTGTGCCGCAGGTCTA 
CTCGAGGAGGTGACGACGACGACGACGATGACGATGGAGGAACTTTCGATACAGTAGGGTCTGGTATACTTGGACGCAAAAAGCGTGCCGCACCTCCACC 
TGAGGATGAAGAAGAGGATGATTTCTACCGCAAAAAGCGTGCCGCACCTCCACCTGAGGATGAAGAAGAGGATGATTTCTACCGCAAAAAGCGTGCCGCA 
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CCTCCACCTGAGGATGAAGAAGAGGATGAGTTCTACCGCAAAAAGCGTGCCGCACCTCCACCTGAGGATGAAGAAGAGGATGAGTTCTACCGCAAAAAGC 
GTGCCGCACCTCCACCTGAGGATGAAGAAGAGGATGAGTTCTACCGCAAAAAGCGTGCCGCACCTCCACCTGAGGATGAAGAAGAGGATGAGTTCTACCG 
CAAAAAGCGTGCCGCACCTCCACCTGAGGATGAAGAAGAGGATGAGTTCTACCGCAAAAAGCGTGCCGCACCTCCACCTGAGGATGAAGAAGAGGATGAG 
TTCTACCGCAAAAAGCGTGCCGCACCTCCACCTGAGGATGAAGAAGAGGATGATTTCTACCGCAAAAAGCGTGCCGCACCTCCACCTGAGGATGAAGAAG 
AGGATGATTTCTACCGCAAAAAGCGTGCCGCACCTCCACCTGAGGATGAAGAAGAGGATGATTTCTACCGCAAAAAGCGTGCCGCACCTCCACCTGAGGA 
TGAAGAAGAGGATGATTTCTACCGCAAAAAGCGTTAAACTACGCACGAAAGTGACGGTGGTTGAAGAATAGACT 

TTTTC AT G AAATGTGT AC AC ACCT G CT AT AT AT ACGTGC AT ATT TGAAT AAGGAAT AAAGTTT ATCTGC ATCTGT AT TTTTT T AT T AGT ATCCTTTT TGT 

GGGATAAATAATCGGTATTTGTGTCCAAAGATAGTGGTGCCATGATGCCATATACAAAGCACGTGATACTTTCCAAGAAGCAGAGTCGTATGACCATTGA 

CGTCAATGCGTGGGCAGAGGCGGAGGTGGTGATAAAGCGTTTCTGAGAAACATTGGGCGTATGACGTCAACTACATTATTCTTCCTCCTCCTCCTCCTAT 

TGCCTCTGCCAG1TCAGTATTTTATTTTTCTTCCTATACAATAAAAAGTATCAGATGAATATTTTTACTGTTTTTCTGTTCATCCCTCTTTCCTATTG 

AAAAAACCAATAACTAACTAATCATGGATAACITGAAAGGGGAATTTGTTGCGCTTAAAAC^GACCTCACCCATTACAAAACACAGTTGGATAGATCTAT 

ATTGGTATTTGTTGATGTTGTTGGTAGATTATATGTTATAGTAAATAGTGAACAAACAGCTAAAAAGGAAGGTCTAGCAACTAGAGTGGCAAAGCAAGCC 

ACAGAGATACAACAATTCAAGGACGAAATAAACAACAAATATAATGCTCTAACAAATACTTTGGATGATATCATCTACATTTTTGATCATGGAGGGAGTT 

TCAAAAGAGCAAAACATAAGGCCATAATTGAAGCGAGGGAATACTCTAAACCGCTGAGGGAATTAGAGTGCATGTTTACGCGTATAGCGGACATGTTAAC 

CTTGACTTTTATGACTGTGTACACCAATATCATTACTGAATTTAGACACTCTAGTGAACAAGCCACTAATAGTATAAATGTCACCCTCGGACGTCTTT^ 

TTGTGTGACGACTTGTGCAATCAATTACCAAAAGAAGAGGAAGAAGAGGAAGATTTGAAACAGAAATTCATTACTTTCCATGCGAACCTATACATGCTGG 

ACACACGCCTAAAGAAAGATTTGATAATTTTCAAAGATGTCATACAACAACTTCACGTGATTTTGCAAAAGGATACCTATGCTGTAAAAGAAGGTGTGG 

CATTAGATGTGCGAAACAGATGAACGAAATAAGTCAATACAG^GACTIACCTCJ^GGATAATTACAATACATTTTCAAACATTTTGAATGAAATTGTCTAC 

AT TT TTG ATC ACGGGGG AC ATTTTG AAGAAGT AAAAC AC AAAGCC AT AACTCTG ACT AG AAATT ACTTG AAAAC ACT C ATGGGAT T AAAATGCATGT TC A 

AACGCATATCCGAAATGTTGTCATTGACTTTTCTAACAGTGTACACTAATGTTATAGCAGAATTTATAAACGCTAGCAATATTTCTGATAGAGAGATCAA 

TAATT ATCTTGTCC AAC TT GT AAC ATG T AACG AATTGTGC AACC AAC TC CCC AAACCT AAAC AAT AC CGT CCCCTC AGT TTG AT AGAT AACATAGCTT AT 

TTTTCTCTTTCTGTCCAAAAACATCTGAGTGGGTTTCTTTAGTATGTGGAGTATTATTGTCTTGAAGCTCATAAGTATTCAACCTCTCCTACTGGTAACA 

TCTCTTCCTCTCTATAATCCCAATATGGATAGTTGTTGCCTGATATCGAGGATAACACCAGAACTGGCTGGCAAGTTAACCTGGATCTTCATACCAGAAA 

ACAACTTCAAGATTGTCCAGAACTCACTCCCAGACGACCAAGTTATCTCCCAATTCAGATATTTCGACCATAGACATTGCTATACGTTTATGGAGATTTT 

GATGGCAAACATTAAAATCCAAGACAGGAAACAAAACACCACAGCCATATGTGAATTGACAACTGGA^ 

GTATTTTTGGGTTCAGAGGAAAAACGAGAAGAGTTATTGGGGAATCTCCCTGAAGGTGCAGAAATTTTCAGGCCTAGAGAAGTTATGCAAGTAATTGGTA 
CTCTCTTGGACAAGAAACTAGAAATTGACGACGGTATAGCTTCTGTAAAGGCTGCCCTCTGTGCTGGTTCATCATCGTTATACCTAATCATGAGCCACAT 
AGTGAAAATGACCTTTTCTGCTATCACAAACATGAAGGATATAAACGAAGAATATTTCGTAGACTTTATATTTCGTCATAAACAATTCCTCAACCCTGAA 
TTCTTCAAGCACCTTATATCTTTGCTCAAGAATTCCAGGAAGGAACATGTTGCCCATCTAGTAAGACGTCTAGAACACTTTCTCATGCTATGGACCCTTT 
CCAAGATGAGGTTCACAGAAATGGAAGAAAACTACTTCCCAATCTCCAGCGATAGTGATTACGGCATCTGTGAAAAATGTGCACGAAAAACTCCCAAATA 
CAAGCTCCGTATTTTTAGGGAACGAAAATGCTC<:GATAGATGTTGCCGTCTTTAT^^ 

ACCCAACAATCC^ATAAAGGCTACATTAATGCAGGCGATGAAATTATCGGCATGCTAAACTCAAATGATAAGGGAAAAACATTCCCTCCTATACCTAAGA 

TGGTTGTACGAAGAGTGGTGGACGGTGTCTACGGGCAAGGAACTATCCTGTCAAAGATTTTGAAGTTCAGGCAGGCAAATATCCCCACGTGTCTATTCGT 

GACATGCAATAAATGCAATAGGATTTTCAGGCTCACTATCTTAGGGCCTACAAGAAACATCCTTTGCCCACCTTGCAGAAAGAAAAGTGTTGCAGTAAAT 

ACACAACAGAAAG^GAAAATAAACCTTCGTTTGTGCAAAAAGGAACAAAACCTCTACGAGTGGATACCGGTAGCAACAAGAACACG^ 

GTTCCTGGGAAAGATTCAATACTGAAGTTTTGCTCCCTTGGCTTGGCTACACTATTGAGTCTAAGTGGCAGAACTGGGAATCTTTTCTGGGTTATTCGAG 

TACCAGATATAAGGAACTGTGGGCCTTTGTGAACAAACAGGAAATATCTTCCATGAAAGACTCCTACATAAAAATTGAAGACATCGACCAGTTATTGAGG 

AGTATCTTGCAAGACCAGAAGGGTGTATTTGAGACCGTCTGCAAAATAAAGAGC-AGAGATGGTTTGTGAATTGGCCACACTGATTCCGATACCGACTAAA 

AAGGCTGATTGATG^CAACCCCCCCCCCTCCAGACTCAGCCGCATGAGTATAAATATGGCCACTTCTCACACCACAGCATCATTCCCTCGTCATCGGTCC 

TACCGTCAACTTCCATTATTACTCCAATAATACCAACAACCCCAGAAATGGAGTCAATCAAACTGTTCACCGT^GCTGGTCTGAATATGGAGCAAGCCAA 

CCAAGTGGCTGAAGAAATCAAGTCAGAATATAAAACCGAGGAGGAAAAGAGGATTGCCCAGGAAGTGTTTGACAAATTCACCAAAAAACTCATTATGCAA 

GTAGATACGTCTAAACACTTACTTACAAGAGAAAACCCCAACCGTTTTGTATCCCGCCCCATTGTCCATGAAGATCTCTGGGAAATGTACAAAAAAGAGG 

TTGCCTGTTTTTGGACATTGGAAGAGATTGATTTCGAAAGGGATCCTAAAGATTGGGAGAAACTCACTCAAGATGAGAAGGATTTCATTCTCCAGATTCT 

GGCGTTCTTTGCATCCTCTGACGGAATTGTAATTGAAAATCTTACAACACGTCTTCGTCAAGTGGCGCAGATTCCAGAAGCGAGGAGTTTCTTTGACTTC 

CAAGTTGGAATGGAGAGTATTCATGGCAACGTCTACGGAGAACTGATTGATAGACTGGTGCCCGACGAAAAAGACAAGGCTATCTTGTTTAACGCTGC^ 

AACACTTCCCCGCCATCAAGAAGAAGGAGCAGTGGGCTATTAATTGGATGC^AAGCAATAACGATTTGGCGGAACTAATTGTTGCCTTTGCTGCAGTTGA 

AGGAATCTTCTTTAGTGGTGCATTCGCATCCATTTTCTGGATCAAGAACAGGGGTATTTTGCCTGGTCTCACCTCCTCCAATGAGTTCATTTCTAGGGAC 

GAAGGTCTTCATCGCGACTTTGCATGCATGCTGTTGAAAAAGGGTTTTGTTGATACCCCATCAAGAGAAAGGATTCTTGAAATTGTCACTGAAGCCGTCC 

GAATTGAACAAGAATTTCTCACAGTTTCCCTGCCTGTTAAATTAGTGGGAATGAACTGCAAGTTGATGAGCCAGTACATTGAATTTGTGGCAGATAAACT 

ATTGGTTGAAATGGGACTAGAAAAGCACTATAATGTTACCAACCCCTTCCCATTCATGGACAATATTTCCCTCGAGAATAAGACCAACTTTTTTGAAAAG 

AGAGTCGCCGAGTATCAACGTGCCCAGGTCATGGCTTCTATGAATAAGATCAAGAAGGACC^ 

CT GC ACC TC C TC C AGT CTCTTCCTC ATC ATCCGAAC AAG AAG ATGTTGAAGACGGCGTC GGGGAC T AC ATC AGTTATGACGATTT TT AGTTC C ACTATTG 
TGTCAATAGGTTGTGTATTGTATTATTATTGTTATAATATTTTTAAAAAATAAATGT^ 

AACCTTTTTTGTTGCGGTAGATGCATGCTCTTGCTCTACCATCTGCCTTTTACCTGATGGGAAGAAACAACCCTTGGTTTTTGATTCTGTATTAGAAGAG 
GTGGTATACCCTACAGATGTGTGTGGGCCAAAGGGAGCTGGCGAATTATTCACTGGTGTGGATCTTT^ 

GTGAATGGTCAGGAAAAGGTCCTTGTCCAAGGATCAATAACGCTGTCGTTGAACGAGATTACTCCCTTGACGAGGAGGATTGTAAAGGGTTTAGAAAGGG 
GTTCCGAATTCCTGGCACTGACCATTTTCATACTGTCTTTTCCCTTTGTTGGGTAGACAGAGATATGCACGCCAAGTGGGTGCGCAACAAAATAAACCCT 
GGTATAGTAACTGATGATGAAGATTTGGTAGATTCTGGTATTAGGACTAAATTTAAATACTCTTCTAAAATTTTTGGTAAAGGATTCAATCCGAGACCTC 
TTTACTCCCTCGACTATCAAGAGAGGATTAAGATATTAAAGTCTCATTTTAACMGAGGACGGGTAATT^ 

AGATTTTTTCCTCGCTTCAGAGAGATGGGCAACTTTTGCTCTAGAGAATGCAGTACCTCAAATACAGAACCATAACAATGGTGAATGGAAAGATATTGAA 
AATCGTGCAAGAACTACGCCAGGTGCCGCGTGGGCTGAGACTGGACCAATATTTTACCAACACAAGAAGAAGGAATATCTAGACAAGAAGAAGAAGTACA 
TCCCTATCCCTCATGCCCTCTACAAGATTGTGTACGACAAGAATAACAAGGAATTGTTCCGTGTACAGAGTGATATGTCTTGGAAATAAAATACATAATT 
AAAGAATTTATATTGTTTTATTTTGTCATTTATTTGATACATTTGTTGTATGT^ 

TATGTCTGGAGCTCTGATGAATGTCTTGCCAGATGCCATCCTAGTTTTTCTCCTGGGGAAGGTGACGTCCGTTTGAGAGAGGGTTCGCTGGGTTTCAACT 

GCAGCCAAGAAAATATCCCTTTGTCCAGTGACGGTAAATTGTGTACCGTTATTTAGTGTGATGACATTATCCCTCTCTGATTGTTTCAAAGCTTCATAAT 

CTTCCTCCTCATCTTCATAGTCATCTGCAACATTCTCAGTGTCTAATAGAACAATAACAGAGTTGGGGGAAAAATCACTCGTGCTACGTCCAGACTGGCC 

AGTATTTTTACTGCTCCATGGTGTAGX3ATCATGCTTTCTAGGCTTGTCTGAATGTAGACCATTATAGTTGGCTCTCCCCATCATATCATACAGTGTACAC 

ACTAGTGTATTACCAATGCGTTTTCCACTCTTATTGTTATCACAATAATCCTCATCTTCTCCAACAAGAGTTGAGTACACACTGGAATCTGTTAGAATTG 

TATGAGAAACTAATGCCTCTTTAAAAATATCGTCACTCTCTTCTAACCAGAAATCTGATACACTATCATTACATTTATTTGCAGTATTATCAATAAATGA 

ATTGAGTAAAGGGTCTAAAGGTTCCACGCCAAATAAAGGTATAGGCACAAACTTTGCCTCATCCGTGCGAGGTAAAAATGGCCTGTCTTCGTTGTATGAT 

TTTGAAAAGGTTGTTGCAAGAACACAATTTCrTTGCATTTTTTAATGCCTCAGCTGATATACGTAGACGTATACTGTTTAACTTTTGCTCTGTTGTCT 

TATTAAATCCTCCACACAACGAGTACATGGCATTATCGATATTCTCCCTCGCATGATCCTCTATGACCAAACCAGCCTTGGTCATTAGAAAATGGTCTGG 

GTCTTTACCGAGAGGAAGAATATCTGCATTGAATTTTGAAATTATATCATTGACAACACGTTCACAGTGCACAGATCTCTTTATGAATTTAGAGGCTAGA 

G AT AT G AAATGT C T ATTT AAATTGG AACC T AGCCT ATTC CAAGG AAG TGAAT TTTGATGT CTTTT ATT ATCCCTCTCGTC AG AAGGG AAC AAAAATTGAA 

GTAATCTGGTCCAATTAACCGCAGCTGCAGTTTCTCTCTGGACGAATGTTTGACGTTGTTTTAGTAGCTGAAGAGACACTTGGGTTGGCTTGTCACTACT 

AGCTGGAAGAGGGTGTTGAAAGCCACAGACGCCAAATCTGAACGTCATCTTTAGATTGGTGTGTAGAGATTTTGTGTATTCCTCTTCGTGGATTTTTCTT 

CTCCTTTC^TTGGGCACAAACTCCTTTGTCTTTAATTCTTCCAAGAAGTCTTGTACATTGTCCTCATTCGCGAACGTTACCTTCCCATCTTGCCCTCGTT 

TC AT C ACGTC GTTGTTC TT AAAC AGGC GCGC AG AAAGCTGC ATC AGGGAC AT TTCTT CC AACTTT TT AGGGT CTTCATGG ATGTCCTTC TTCCT CTTCC C 
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CGAA^GATAAAAGACCCTATAACATTCTCAATGCAATGAACTGCATACTGTCTCAATTTTCTGTTTGTCATCCCTGAACCAAAGCCTATTACTGGAACA 
ACCATGCCGCACTTGTGGTTGTATGTGAGCCCACAAGCAACTGTCAATTTCTTCAGAAGGATCTCGTATTTTTCACTTCCATTAACATCGTTCGTAAAAC 
TCTGTTTATACTTGTGAGCAACCCACTTCACTATAAATAACAA 

GTCTGCCATGTAGCCATCTGATTCGTGTTCGGCGATAGGTGTCCTCATCATTGTGACGGATGCTTTCTCCACTCTATCTGCTACAAATTTCACTGCTTTA 

ACAATACTTTrTCTGCACGCATTTTTGTCATTAGAGTTTATTCTTGCATATTGGAGTTCACAGGGGTAATATTCTTCCCCGTCATTATCTTCATTGGAGG 

ATGAATTTTTAGTTTTGTTGTCGTTCTTTATATACTGTTTCAATTTTGAGAAGGGAGTATCCAGTACGTTTTCTCTACATGCACTAGCCGCAGAAAGAGT 

GAAACCGAGCAGGTAAAGTTGTACACATGAAAAGGCACCTCCAAAAATATTACCAATGAAATAAGCAAATATTAATCCCCTGATGTGGTTGCTAGGGAAT 

CTAAAGTTGTGCTCCAGACTATAATTCGTTTCAACGTTAAAGTACCTTTCCAATCTACGAGGCACTTTAATGTTTATGTCTGGAAGGATTTCAAAATTCT 

CACPAAACATACCCGACAACTTGTTTGATTTTCCCAATACAATATCTGACCTTATGTATTTTTCATAAAAGTCAAAATCCGTCATCGTTAAATCAGTTGG 

GGGCTTTT TTGC ATT TT T AAGAGT ATCCGAAAG ATGC ACC AAACC AGTTC TGT AGTTGT AGTTTGCCATTGCTTCAT CGGGC AAT AATT T GC TC ACT TC A 

TT GT ATGC ATC GAC AAC TTTTTCT GG AGC AAT TT TTTC AGCAT AT AAACGTTGC C AT AT TTT CTT ATC AGGATCT AGT AGTTC AACGTGTCT AG ATT CT A 

TTGTCTTCACATCGACAAGAAAGTATCCTTCACGTATACTGGTGCAGGCCAATCTCTCCATCTTCTTGTTGTAGTAGTCTATCCTCTTCCTGAAGCAAGA 

TATTATGGAAGCAACTCCATCCAACATGTGGTACAG<K;CTGATTGGGTACACCACTCGTCACTATTTACTCCATCACGAGCAGTGAAGAAATCTTGAAGG 

AAACTGTATATAGAGATGAACGAAAACAGTCTTCTGGATGATGTTTTCAGTCTCATGTCAGTTCTCTGCATAAGTTTCTCGTGCCGTATAACTCTGTCTA 

CGAATTCCCATGTCAGGTTTTCATTTACGTATTCTTTTAGGAGTGCCTTTTCTGCTTCTATAGTTGTAGTAGAAGTTGAATCCATAATGTAGGAATGAGG 

GAGGTACTTTGGCACGCACGTGTGCACTGACCGAGACACAACGGTGTACTGAACAAGGGTTAAAGACGTTGGCTGTTTTACACCTCTCAGAGGAATCCCT 

CTACCAGAATAAGTCCTTTCATTTTTTTAATTAACACACGCAAAACTGACGCTGAAATCGTTTCGCAATAATGGGAGAGGAAGTGGGAGGAGAACTTTCG 

CCTGTGGTCGTAGGGATACTTTCGGTGGTGTTAACCGCCATAATTATCTCTACTATATTTGCCATCTACTACTTTTATTTCCGTCCAAAGGAAGGGTTAA 

AGAAATCGCGAGTCAGTAAAGACCCCTTGGCTATACGCGTAGATGCTAACCATGTCCTCTCCAATAATTCTCCCTACATGATGTTCAATACAAACATTAA 

GGCTAGAAATACACTAACAGAATTTGATCCCAAAAATATGAATGAAGAGGAACGAGCAGAATTTTTCAAGATCCATTATTTTACTGGGTTCCCTAAAGAT 

GTGGAAAAGGAACAGGTGCAGAAGAAGAAGGAGGAAGAAGAAGAAGACGAAGAGGAACAGACGACAGATACTAACCCCAAAAATAAGAAAGTGGAATATG 

TAATGCTCGAATCTCTGAGAGGGGATAAGGCTGGATTTTTGATAGATGAGAATAAGACCATGTCTATACAAGGTTCTCCTGGTACAAAAACTGTCTGTAA 

GAAAACGACCCTACCAAAGTTTGTCCCAACCCCTAAAACGGAAAAGTATGTTCCTTCAGATGAAAGATTTTTCAGGATGGAAGAAGGAACAGGAGAAAA 

GC AGAACAT T ATTC C TT C AC GTCT GTTAAG AC AGG AAG AAC CTT AATC ATGGGC TCT TGT GGCT ACCCTGTG AT GAAGGATAT AAGC AAAGGT AC AAGC C 

AAC^GTACATGTACACATTTGAACCAATAAAGGAAAAAAAGTAATGTCTTTGTTTATTTCCCCCTTTTTTAGAAAAATTGGAACAACAACAATGTCGTTG 

GCTGTGACAGAAGATTACGGGCACAATGAAAAGTTGATCAAACGGTTACAAACCrCTGTATATCACACCCCTCTATTAGGTGCAGACCATGTAATGAAAT 

CCATATCAGACTACATAATTTCTCGTCGATTCATGAACTACACAAATTTATTAAAACAAGTTGAATATGTTTTCGATGAAGAAACAGGAGCAGTTATAGC 

TAATATCTCTCTGTTAAAAATCCTAGAAAGATGCGCACAGAAAGGAGGAATATATGATGC^^ 

GTAACGCGCCTATTTACTATTATAGGAGGTAGGCCCAATATGACGGTGCGGGTTAATTTTAAACATGGGCAGACAAATAATCCTGCCTATGGTTATCTCA 
CAGATGATAATGATACTACTACTGTTACTCCTCCTGTTACTCCTCCTCCATCTCCAGCTGCAAGAAGATCCCCTTTTTTCACACGCACTCTCATATCCGA 
GTCGTCTTCAGTTGACCATTATGTATTGATGCATGATAACCCAAAAAGATCTTCATTTAAGGTGTATGATATTCACGCAGAAACCTTTCCCCATAAAGCT 
CCTTCTGTTCCTACCTTCCCCCCTAAAACCTCGTTTGAAATTTCTGACGTGACTCTCGATTGTTCAATGGAGATTTTTTCACGAGACAGGGATGTTTTAG 
ACAATGTTCACGACTATATTGCTAACGACCCCGTACCATTTTTAGTGGATGTTGTGCACCGTGGATCTAGTCTCCGTTGAGTAAGTACTTTAGTTGCAGC 
CGGTCTCATTAGACCTGAAAAATACCCTTCArTTGTTAGAAAAGTGTGCTTAGATCAAAATGGCATCCACTTCTTCATCAACTAAGAAAAGAGTACATGA 
AGAAGATGAAAATCTCATCCCACAACCC^AAAAGAAAAAATCAAAGAAAGTACTACCATTTCCTGTTGACAAGTATAGAGCTGTGGATAAAAAGGTGGTA 
AATCTCATACACAAGATATTAGATCAAGAAAAGGACCACCTTTCTAGTACCGAACTGCAAATGATAACTGAATGTAATGGTGCGCGAGAAGATCTGCTTA 
AACATCTTCTAGACGAAGGAGAATTTAACCCTACTATAATTGAAGTAGTATCATCCATGCCTATTGAAACAATATACGAAATACTCTCTTCTTCTGCTGA 
CGACAAGAAGTTTGTACAGATATCATTATCAATGTTGATCCACATACTTTTCTTCGCTGATAAGGGTACTATGTGGGTATCCAACGCGTGCGTTCAAAAT 
GTTTTGGGGAACGACTATAAAGTGGAATTTGAAAATATACGTAAAAAGTATCTGATATTGGAAGACTTACTGAACGGCGTTTCAAATCATTGGTCTGAAC 
ATGGTCCTCTTTCTCACATGCTCCATTCTTCAATCCCTATTGTACAAGACATGTTATTGAACAGGCTGGTGCGTTACTTTAGCACGTATGATGGAGATGC 
TCAATTCGATATATCATTCATAATTAATAGTGTCTTGTGGGGAATTGATAAAAGTGTTCTCAACGAATTGACACAATTGATATCGAGGGGTGTTTTCATT 
GTGTCGTACGTACCGATGCGTGTACGAACACCTTCAAAGGACAGTAATCGGCCACAAAATACTCCTTCACAAAATATGTCAGCACTAGGTATGAAACTCA 
ATACATTTTCATCCAGAATCTCAGTGTACAGAAACAATACCTTTAAAAAACTAACCGAGTTAGTGCATAACTTTGATTACGGTTCCAAAGATGCATCATC 
ATCATCTCCTCCTCCTCCTTCATTATCGGACAGCGTCAACACTTTTGTGAGGTTGTACACCAACTATGACATATTCTTAAAGGTGATTTCCGACTGGAAA 
ATGCCTTATGGGTTCTTTAAGAAAACTTTTGACGTCCTTTATTCTAAGGGGTTGATGACATTATCAGTGTCTGAATATACACTCAAAAAAGAGTTGGTTA 
CGT/TTTTGCGCGCCTTGAAGGAAAGGGAAATTTTAATCTATAAAATGGAGAAGAGAGACATTATATGTATACTGAAAAAGTCTTTGTTCGGATTTAATTT 
CAGGTGTTTAAAACAATTACTCCCTCTCTTCAAACACTTTTTAAAAATTGAAGAGGTTAAAC^ 

TGCAAAACTCAAAAAGATTTGCAGAGTTTCCCTGCCATACAGTCTGCTTCACTTTTCATGGAAGAATTCCCTTGGCTTGCAAAAACTTGGArCGACGACG 
ATGATGATGAAGGAGGAAAGGGACATACCCTATTAACATTTGCTATAGTGCACAGATATCCCTTAATAAGCCAACTTATTTCACACCCAATTTTAAAATC 
GTTAGTGAATACTACATGTAGAGACAAGCACTTTACTCCCCTCATGCACCTCGCCAACACGTCTATAATGTACCAATGCAATACACTCTTGTGCCTTATA 
ATAAATGGAGCTAAACCAGAGTTCATAAACAAGTTCAACGAGAATGTTTTGCATATAGCGATTGAAAATGTTAACTATGGAGTCATCACTGAATTGAGAG 
GAACATTATCCAGCGAACAAATTGAAAAAATGGTCAATGTAAGAAGAATGATGGATAATACAACACCTTTAATGATCGCCTTGGCGAGGGAGAATATTGT 
ACTCGCTCAGCTTTTTGACGGTCTTTACAAGCCCAAAATAAAGGTCCGTTTCGGTTCTTCAAAGAGGCTAAGGATACCAGAGTTTGTCCTCTTAAAGGGC 
CTAAAGGAATCAGTTGCATATTTGGAAACGAGGAATATATCCTACGATATTAACATCATAAAGGATGCAGTAATGGACAACAGTCTTTTTGAAGAGGAGT 
ACGAAATAGCAGCAGCAGGACTGCGAGGCAATAACTGCGACCCTGAAGCAGACGAGAAGACTATGAACACGTGGAACTTTTTCACCAAAAATTCAACCAA 
ATGGGCAAGCTCTATTTTCCAAAAGAATAGGCAGAAATTTGTAAAGATTGTGGATGGTATGAATAGGACATATGAAGACTCTGAATGTGCAATATGCTTG 
G ATAGTCTG<3AC GGGG ATC TTCCT TCAGGGAG AAC AACGTGC GGTC ATTGC TTC C AC AACGT CTGTTGGTT ATC CTTGATAAGGATG AGCGGGC C G AAT A 
ATGGCAGCCGCGCAAGAGGAGGAGGAATAAAATGCCCGTCCTGCAGACAAGTCACCTGCCTCGGAAAAAGACTAGGGGTTGCCGACTATGATATTGAAAC 
AGAGGAAGMCGTGACACGAAAAATGTCGTGCCTTCGGTAGAAGAAGGAAGAAGGGAATGGAGGAAGATTGGTGTTGACAGATATGAATTTCTTGTAGGT 
GGAGTGTGGACAAATGAAATAAAACTATAAAATGAACTAGAATATTGGTATTATTTTrACACCTTACACATGTTCCAGAAACGTCTGTTCCAGAAATGTG 
TTTTGAGATTTCTGGACAAGCCATTTCTGGAGTAATCTGACATTGGGTCGACCCAGCGGTCCACCCTCGGAACTTGACATTGGGTCGACCGAGCGGTCCA 
CCCTCTAAACTCGAGCGACCCAGAAAAATTTTTAAAAAATTTTTGAGATGGAGAATGAGTAAAATCTCTTCCCTGAAAG 

CGGTTCTAGGTGCCCGCCCTCTCCAGAAACGTCTGTTCCAGAAATGTGTTTTGAGATTTCTGGACACTCCATTTCTGGAGTAATCTGACATTGGGTCGAC 

CCAGCGGTCCACCCTCGGAACTTGACATCAGGCCGACCCAGCGGTCCACCCTCTAAACTCGAGTGACCCAGAAAAATTTTTAAAAATTTTTTGAGATGGA 

GAATGAGTAAAATCTCTTCCCTGAAAGGGAGATCTGAAGAGGCTATACCGGTTCTAGGTGCCCACCCTCTCCAGAAACGTCTGTTCCAGAAACTTACTAA 

AGTTAAAGTACGTTTCTGGACAAGCCATTTCTGGAGTAATCTGACATTGGGTCGACCCAGCGGTCCACCCTCGGAACTTGACATCAGGCCGACCGAGCGG 

TCCAC CCTCT AAACTCG AGC G AC AC AG AAAAATTT TT AAAAAAT T TT TGAGATGTAG AACGAGT AAAAC ACTCT AGTGG ATT AGGGTGTT AAC AGC C T AC 

CCTTTCTGGTCGCAGCTAGACCCAGAAACATCTGTTCCAGAAACTTAATAAAGTTAAAGTACGTTTCTGGACAAGCCATTTCTGGAGTAATCTGACATTG 

GGTCGACCCAGCGGTCCACCCTCGGAACTTGACATCAGGCCGACCCAGAGGTCCACCCTCTAAACTCGAGCGACACAGAAAAATTTTTAAAAAATTTTTG 

AGATGTAGAACGAGTAAAACTCTCTAGTGGATTAGGGTGTTAACAGCCTACCCTTTCTGGGCGCAGCTATACCCAGAAACATCTGTTCCAGAAACTTACT 

AAAGTTAAAGTACATTTCTGGAGATGTGCGCACGTATGCATGCATGCATACGTACCTATCTGTATTTTTTTAAAAGTAGGGGAAAATGCTATTTAATAAA 

ACAACCACCATTCTCCGCTCACTATACAGCTGACCTTCGTTGAGTAACCATGTCTAGCGGAAAAGTAACCTACGAAATCGTTGAAGGGGGATTGTTGAAC 

AACAAGTACCTTCTAGATGGAGGTGCAGCAATCTGTCTGCAGTCTAATTGTGTTGCAAGAAAACGTCACGCCGGTTCCCTCCACGATAACCTCTTCAAGA 

TGCTAGGATTTGGCGACCCCTATAAACAGAGACGGGGAAAAACAAACAGCAAAAATCTGGCCATAATTGAAGATAGACCTCAACTCGGGTCAGTATCAGT 

TGTCCAACACCCGACAGAACCAGAAAGGTTTTGCTCCATGACATTCTTATTTGCTCAGTACAATATGGGTAATGGAAGAAAATGTTACTTCCCTAACGAC 

AAAGAGTATGTTGAGAGCTGCAAGAAGCACGAAAGGGTCCACAAATCTTCCACAGAAATGAAAAGATTGCGCTTGTATTACTTTAACAAGTGTCTTCACG 

CGATCGCCAAATCACCTGCAATGAAGAAGTACAACAAGATAATCTTCCCTGCCAGAATTGGGTGCGCGGCAGCTGGAGGAGATTGGGAGAAG 

TTCTATTCGAGATTTCTCCACAATCATTGATAAGGAAGTGATAATAGTGTCTCAAAGGATGTAATTAAAAAATAAAACCGTCGCTGGCAACCCCCGCCAC 
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CACCACCATCGAGGCGGGTATAAATAAGGGGCGCTGGCACATGGGTGGCACACTCGCATCATGTCTTCCAACCGATTCAGTCAGCTGAGGGGCAACGAGG 
AG AT GGT TGGGG AC T ATTC AAG ATGGAC AACTGTC AAG AAC AGG AGG AACAGAC AGC AAC AGT ATTC CC AT AGT TTCCGTCCCC AAC AACAAC AAC AAC A 
TCAAAAAAGAACATCAACCAATTCTCCTCCTGCTCCACCTCCTCCATTCCCCATCATTAGTTGGGGAGCCCTCGGCAGCTACTCAATGTATCGACTGGAT 
GACCAGTGCAGAAATTGCGATGAAACTGGCTATTACAATTTCCACTCTTATGATAGAAAGAGGGAAAGAGTTCGCTCATTAAACAACACTCCAAGTGAAG 
GCATGTGGCGGCGCACAAGTAGATCTTCCCCCTTCCTTAATAAGAAGAAGGACGrTGACGAAGCTCCACCTCCTCAATCAAACCAACACATGTACCCCCT 
CAACAAGTACAGTTTCCGTGAATATACTCCTTCATCAAAGCTTGTGAATTGGCGAGACCCTTCACAAGAAAAACAGGACAAGATCTTACAAGAGGAAGAA 
GCTCGCGCCCCTACACCCACTCCCCAAGAAAAGGAACCAGAAGTAGAAACTAAAGATGATGTTGTCATCGAG^AAGAAACTGCACCAGAACCAGAACCAG 
AACCAGCCCCAGTTCCAGACCCAGATATTCCCGCAATAACTGCAACTACTACTACTACTACAGTTGCAACACGTCACGACGATTCTTCTACAGTATTTCT 
CAGAAATGTTATTCTGAGTATCGTGTTTTGGTTTCTGGGTGTTTATTCTGCATTATTTGCAAAATGTATTAGATCTAAGAAGGAATAAATAAAArGGTAT 
ATG AAAT TT AAATC TTT AT TGTTT CTTTCC AATT ACTCC ATTG AAAT TGTCCTCCTTGTGAGTGCGC CTTCTGGC AACAC TCCC ATGTCCAC AAGTGC T A 
AAGGTTGTATAGTTCCATCCCTCTTTGGTGGGAATGGTCCTCTATCTGATTGACAAACAAAGGGGAATCTATCCTTCAAATCACCCTCTGAAACAGAAAC 
AGTTTTTTCTCTTGTATAATAATATCTGCTCATTAATCCTTCATATCCAAAATGGTGATGGTGTGCAAATTTCCTTAGATTAGGTCCCCCTACGCTTCTA 
AAGTAGAGGGTTGAAAGATCATCAAATGGTTCTGGGCCTTTAACTC^ 

CTGCATAGGTTATTTCCACACCTCTTGTACAAGCACTCTGAGCAGCTGTAGGTGCGCAAGATTCGAATCCTCCATGGTAGGGTGTCAATACAGGAAGTAC 
ATGGACTCTTCTCATGACTCTAGGGACTCTTTTACTCGCTTCATAGAAACCGACCATGTCTT^ 

CTAACGCGGAGATTTTCTGTAGCCTTTCCTTGCTTATTGATTGTGAAAACTTCCGCATCTTCTTCCAAATCCACATAATCTGAAATAGATAATCTCCTCT 
TCTTTGTACTACCGGCAGTGCTTCCTCCTAACATTTCCATATTGCTGTTAACTACTTCCACCACAACTTCACTCCCATCAGTTGGTTCAATGCTTGCGCC 
TTCATTTATTGTCGCAGAATCGGCCGATATTGTTGGCGAATCTATGTCTGTAAAGCAAGGCGTTGCATACAGGCTCACTATATCCTTGACACCCTCAACG 
TAGGGGGAAAATGTTCCCACGTTAAATGTAGAGGCAGTAACTCCATGAGAAGACTCGAAAAAGTCCACTATCCTTCTAGCCTTTGCACCCTCTTGACCCG 
TTTTTGCTGCATCCCTTATAATGTGGAAAATTTCAGCCTCCGCAGGAGGCACGCCCGGCAAGCGTTCATAATCCTCCATTAAAATGTCATTAGATACAAT 
ATCTGTCATACCAACATCCCCTAACAATTCTGCGTTCCTATCGAACGTACCCTTCCTTTCGTCCAAAACAGCAGCAGGTACCTTTTCTTGTCTTGTGGAA 
TAAGGAGTGTAAGCCGTACTCCTACGCATGGCACAAAATTTATTAGGGGCCAACATAACATTGGAAGATATATCCCATATATTTTCTGTTGCAGCTTTTA 
GGTTTATCTTTCTTCCATTAGTTTTAGTTTCTCCGATAG 

GTTTTTAGGGAGGGTAGTCATGAGGGGAAAACGTGAATGCCTATTTCCTCTGCTAGGTGTTGCCGCTATAGGGrCAGTTTTGTTCCTACCCCCTCCATTC 
ACTTTGGGACAGAATAAACCGTTGGTGCAATTACAGTTCTCCTTTTCTCCCAATTCC^ 

CAAAAATACTCTTTTCTCCAATATTGGCAGGAAGTAGCCGTGTTTGTTCCTCTACATAACTCTTTATAGAAAATTTAGATAACGAAGAATGTGATTGCGG 
GCCCAGGGGTTTGGGGACCATCATTGTAGTGTACTGAGAGAGGGCACCCGAGCACGGGTATCCGAAAGATTGGCCAGTTTCTTCCCTGACACAAGGCGGG 
GAGTACACGCCGTTCAAGTAGTGGCCGGCCGTGTGGGGGCTCAGACCCCAATCTGCTCTGAACCGATTAGGGATCAATTCTGGCCTAATAAACGTTTCAG 
AGGAAAGAGTCCTAACCCTGAGTAGCGACTCAAAGAGAGCCATAGGAGGAGGGATGAAATTGAGGGCTCCTTCAGTTTCCTTGACTTCACATTGTTGCCA 
TAATTCATCCCCTTCATGGAATTTTAGAGGGACTATTCCATGCATGGACCCGACCGTGCCGTTCGAGACTGATTCTTGGACACTCTTTTGAATGAGAAAA 
CTGGAAGGTATGGGTAAAGGAAATCCCCCTAAAACTCCCAAAAATGATGTAGCTTGAGCAGCCGCAGACACTCCCGCAGGAGGAGCGAGAGGCTTAGAAA 
AGGGTCGTGCTGGAGCGTACGAATTGAAGGAGGTTTTAATGTCTGTTGTCGAATTTAGAATAGCTCTAATGTCTCCCTTTGTTACTCGATCATAAGTGTT 
GTTGATGGCTGTCAAGTTGTCGGCCGAACCAGGGTCTCCTCTCGCGTTGACCGGCGATCTGCGTTCGAACCAGGTGTGTTCGGGGAGAGAAGATGCAGGA 
ATTTCCATAGGTCTTACAAGAGGTGAAGATGCACCATTCATTAAAACTCCATCTCCAGTGAGGATTGCTCCTGCAAAAATATCGGGCTTTTTTGATAACG 
TTTTTGCGACCCTTTCGGCCTCGCGCATACCCGTACCTTCTATTCCATTATTTAGTCCTCTTCCCGAGAAAAATGTCCCTTCATTAGAAGCTATGCGCCA 
CGCAGTTGGAGGGTGAGCAAACATTTCATATGCTTCTTTTTCCCCCACTTTCTTAAACACTGCAGATGAACCAGCCTTGTAATGATCAATTATTTTGGGA 
GAATTTGTAGACAATATTTTCTCACTGAATCnrTGCAAAGGTGGAACTTTTCTTAGCACCCTGCGCTGTAACAGGTTGCAGAATAATACCTCCAATTTCTT 
TGAATTTGGCATTGTTTGGTTCATTGTATATTGATCTTG 

CAGAAGTAAATTTGTAACGTCGTCAAAACTATTACTAGAATTTGTAACCATCATAGAAT^ 

GTCAAAAGAGAGTGCTrGAAAATTGTCTCGATGGGAGTGTGGGCCATTCGCTTAGCGGTAGGTCTATAGTGTACCAAGGGCAATATCTTATT*rACTGTTT 
TATCATTTTTAGCAGAAGGTTTTTCCTTTATTACTCTATGGAGTGAAGGAACGACGTATGCTGTTGCGCATAT^ 

CTGTAAGGGGTTATCTTTGTACGAATCGTACCCTGCCTTGTCAGAATCTTTAAGGAAGGATGTTTCTATGGTCGTATCGTTGTGCGCATCCCTTCTGTCA 
GTCTTTACTCCTGAAGGArCGCCTGCTCTCATGTGAACCGTCTCGGGGAAGATTTTACCCTGGGACGCTATTATGCCCACAGCATGATCTATGCTATCGA 
TTCCTAAACCGGGAATATTGCTATGGGGTAAAGGCACGACCCTGAGCGATGCGTTTGGATGTTCAGTAAAATTTCCCCCTCCAGCTCCAACCAGCCCATT 
ATCGTTTAGTGCATCTCCGTTCCAGATGTGAGCGGCCACACTTCCTGTTCCACTGTAGGGCACTACACCTCCTACAGATTCACTGACTGTGAGTTTTCCT 
TGTTCGTCAGAATCTTCAATTCTTGAGTGCCAATTGCCGGCGGCGTCCATGGCAGATTCGCGCAATAATCCTCCCGTAATCAGATGGCCTCCGTTAAACG 
CTGTAATTTCAGGTGCTCTTCGAAAAAGAGAAGCGGCATAAGAATAGCCGAGAGCCGACATCCTCGTATCGTT^ 

AGTACTACCACCTAAAGTTTCGTACCTTTCTGTTATTTGCAGCAATAATTTTGCACCCTCTACAAAGGGTTGAGCATAATCTGTTTTTGCCGCATCGGGA 
AGATCTATACCGGCCACTTCTTCCAGTCTGATTTGTGTACTTTCATCACAAAGACAGCCCTTTGTTCCAGTATCTCCTACAACCCCATACCTGAACATGA 
CAGGGCAGCCTAGACCTTGTTCCTTTTCCTGTATACCACCTTCTTTCTTCTCACATTTTGGGGCGTTGGTGGCAGGGTCTCTCTGCTGGGAGTAGCCAGG 
TTCGCACAAACAATCCATAGTAGAAGTCATACTGCTTATGTCGAAATCGATATCGTCCACATCATCAATAATTGAGAATGTTGTAGGGTGTATAGGTCGC 
CCCACTCCTTTTCCTCCACACAGTACAATATCTGTGCACCCAAGAATGGGCTC^ 

ACGAATCTACACACCCTCTTTTTTTGGTGATATAAGTCGTTCCCTTTTTTCCTCCTCCAGTTATGACTCCTCCAGATTCGTAGTGCAAATAAGCCTCCTC 

GACAGTAGTAAAAGGGTTGACGTTTCTGTAGGCTGCATCTGTAGCAACATCATCGTCGTGCTTGTCGAACGCAGAAGAGGCTGAAAAAATAGTATCGGCG 

AGAGGGACACAAACTCCGTCATATTCATCATCATCTTCTATACCGTATTCAGTTTTCAATTCTCTACTGGATTTGACGCAAGTTCCTCCCCCGAGAGAAC 

GACAGCTAATGCATTCAAAATCGTTGCCTCCTACGCGACATTTTTTAGGAGTTTTAAAACAGTCAAATGTGTTATAATGGGAACTAATTTCCCCT 

TAGAGATTCTACAAAACCTTTCCCTATGATGTCTGACTTGACTCCAGCCATACCGAAGGGGGCGTCCGTATCGACATATAAGGGGTCGAAAGGGTTGGGA 

ATAATTCCCCCATTTTCTTTTAATCCTTTGACGCGTAAAGAGTTTCGAGACATGGCTCCTGTATCTACAAAAGAGCTCGCTTAACCACAGTTGCACTCAC 

CCGGGTACAATAAGACAAAAATTATAAATTGGGAAGACCGATACAGTCTTTCATGACAAAATACCCCGAGAACAA 

ACATTAAAAATGGTTGCCCGAAGCTCCAAGACCAAATCCCGCCGTGGAAGCAAGAAGAGGTCCACCACTGCTGGACGCATCTCCAAGCGGAGGAGCCCAT 
CAATGAAGAAGCGTGCAGGAAAGAAGAGCTCCACTGTCCGTCGCCGTTCCTCAAAGAGCGGAAAGAAGTCTGGAGCCCGCAAGTCAAGGCGTTAATTCTT 
CCCTGTACAACAACTATGTTATTTAATTGATTTTTTTTCTTCTGAATAATTGGAAATAATAAAACATC^ 
TTTTAACCAACTATAAATCCACATGTGGTATAAAAGTTTAAGGGT 

AAGAAGTGTCTATGAATTTCTTGTAGAATTCTTCCCTGTCTGTACTTGACGCTACATTGGCAAATACCAGAGGGTCTACTAGTACACCTGAGCTAGTGCC 
GGGTTCTATTTTAGACATGACATTTTGAGGGATTTTAACACATAATACAATATCATCATGGTCGAATGTCGTCCTGTTCCAGAGACGGCGATGTAGTGCT 
GAATTTCTATCCACTAAAACTAGACCATCGTCATTTTTCAAATGCTTCTTTCCTTCATTrCTAGCTACAATTTTCATGTAGAATCTTACTCTTTCTGCAT 
ACGACACCACCTTGTCTTGATTCCCAACTATAGCAAATAGTAACGAGGAAGTAAATTCATTGTTTGCACCATTATTATTGTAGAACCCGTTTGCACTACC 
AATTCCTCCACATCTAGGGTCTAGACGTACCAGAAATTGTCCATCGGCTAACCGAGTAACAGGGTCGCCGCTAGGCCATGTGGCTAGCCGAGTCTGTAGA 
GGGGCAACTGGTTGTCTCAATTGAGAATAAGAACCCACTGCGCGTAGGCGGCCAGTGTTTCTCCAGTTCGTGTATCTGTCTGCAGGTTCGTCAGGGTATT 
TATTTGCTCCAAAAACAACAATACCCCATTTTCCTAAAATAGCCGCAAGTTTCTTGTCTTCTTTATTTACCAGAAGATTrTCCTTTTTGGTTCCAGACTC 
GTCTACCACGTATGTAGTATTTGAAGTGTGCACAATTTGGTCTAATGTTTCTTCAGAAACGCCGTCTCCCATGTACATTTTAGCGTACACTTTCTCCCCC 
TGTGTATACGCTTCACTAGGTGCTGGCAATAACTGTCGATGACGTTCTTGTTCCTTATCGATTTGTTCCCTCAACCAGTTTTCTATTGAAAATTCAGTTT 
GAGTATCTGTATTGACAATGTTTGTAGTATCATTCGTTCTTATGGAAGAAATGAAAATGTGCCGAGGGTACTTTATTTCTCCCCCGTAATCAGCGAATGC 
CCATTCAGGTATGCTTGTAGGTAGGGGGACAAGACTAGTCTGTTTCATGAAAGGGTCCCTTATCCCTCTATTAGAAATGGGAGGAAGTGTAGTGCGACAG 
TAGACTATGCCCTTGTTGGTTATGTCGGTGGCGGGTGTGTCCCTTACAGGTGCCGTAAAGGTTGAGGGATCATTCCATATAGCCAATTCGTTAGATGGTA 
CAGACATGTCTGAAAACTGTCCTTCAAAATGCCCGTCCATCGTAAAGGGCATCTGTATGCCTTTATCGTCACCTCCAAATACTTCCATCTTTACACTGAA 
AACCCTATCGTCTCGTGCACCGTCAATAAATATAGGGCGGCTGGTTATTCTCGAAGAAAGAACGTTTTCTTTTGCAAGTATAGAACCAGACACGGGATCG 
G AG ACTGTG ATGTGGT AGCTTTT ATCT TC AAGGAC AACATCATC ATC ATC ATCATTG ACGAT AAC AT CAG AT ACGGT C AAGAC AGT AGT T TT GTTCCTTG 
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ACGTGATACTGTGAATGGGTTGGAATGGGTTATCACTCGCTTTCACTAGGCCGTTATGCGAATCTAAAACAGGGAAAACAGCGTACAGTTTAGTAGGTTC 
TGCGGGTGCCTTAAATGTAGGCTGTTCACCTCCCAGTAATGCAGTTTGAACGGGGGTAATAATATCTCCACCTCGTACTTCAGTCAATGGACCAACTGAA 
ACGTTTAAAGCACGAGGCAGTAGAGATGTTGTGCGTTCACCGGCATATTCTGTGGCGGGGGAAGAAAGCACAGCCGTTATTCGAGGTTCTCTATTATCAT 
TATCAGTTTTCAAGATGATACATGGTCTTAGTTGGTAGGAGAAGAAATCTGGCCCGTTCGTGGGTCCAGGCCAATCTGGTGCAGTTGCAGCATCTACTCT 
AAAGTTTACGCTAGAACCGAAGCACCCGACTCTCGAAGGTGTATTATGTACAACAACTTTGTTCGTGTTAGCATTCACATCAGTACTTATAACCGAGAAC 
AATTTATCAGATTGAGTAACTTTTGATGCTGGAAATACAATATTGTACCCAGAAGATGTGGTTAAAACGT^ 

CCAACATTTTCCATTGCGCTTCTATTTCACTTTCAGGTACAGACACACATATAAAGGTCCGAGCTGGGTTAGATGGATTTCTAAGGTGGGTTATCTCTGT 
GGCGTTTAGAATCTTGTAACCCTCGGATTCATTTCCCTCCATATTCTTTAACGTTTCATTGTATAGTACTTTGTTTAATACTGGTGTGTTAACAGTGTTT 
GTTTTCGAGTACACGAACGACACTGAAGGTTTGTTAATATCTGAAAGTTGATTTTCGCGAGGTAGTTTAGCTGCTAAATTGGACGCTATTTCAGCTTGTC 
GGGAAAGACGGGGAAAATTAGCAGGGTAATCTGAAACATATTTTTCATTCAATGTATCAGGAGTGTATTCCACATTATCATAGTTTATTGCTrTAGGCAT 
ATCAAGCCTCATGAGGGCTACATTTCTACAAGCAAAAACTTCCGTTATTTTTGGT^^ 

TTTTTATCCTCTGTTACAATGGAAATTTTCAGTCTATTCCCAGTGTTGAAAATACTTCCCTTTGTTGATAGTGTTATACCGTCAGCTAGAGAATTTACTT 
CCACGTGTTTGAATACGGCTTGTTGTTTATTGCCATTGTTGCGAGAGAACGCATTCGGGTCATTGGTGATAGGAGGCCAACCTGCTGGGACAAATTCAGG 
GAAAGTTTCGAATAGAGACGGACCACTATCGCTGCTGCCACCCCCGTCTTCTCCTTCATATTTTTCATCCACTAAATCAACTTTTATCTCTGCCTTAAAG 
GGAAGAGGGACATTATCTGGTAATATTCCGTCAAGAATATGTATCCTCACGATGGTATCTTTTCCACCTTTTTCTAATAATTCTTCTGTACGGATATTAA 
TCATTCGAATTGGCAGCAATGATGATGATGATGAACCATCTCTATCAGGAGCCGATTGAGGAGAAATATCATCACCAATATCATAACCGATCTGCCGTGC 
AGTGCGTTTCATTATACTGTGTAACAATTTTGGATCCTGCACATCAGTGACACGTTTTAATTCCTCAGAGGATATCCTTTCAATATTTTTAGCGAGTTTT 
TTTGACGCCGGTTCTAACCCATTCTTCAAGTTCCTTTTCTTTTTCGGGTCTTTTCTAACAGCAGCGGAATCATCGATAGCCGCTGCTATATTCCCGAATA 
GAGCTTTTTCTATTTTCGCAGGTTTTCTTATGTGTATCACACCAGATCTTTCATCGGGCGTAAAATTGTTCTGGACAATTTTCATACCAATATAGGAAAT 
AATAGAAATAAACACAACGAGAAGCGTGAATTCAAAGGCACCCCACCACGAGCCAAAAATACCCGTGACTGCTGTTrCTCCAAACATTGTTATTGCTGTT 
CTTAACGGCAGGGTTGGTTGTTCGGCAAATCAAGGAAACAAAAAAGTTACCCGGGCATGGCAGGGAATAGAACCCAGTTCGTATCGTCCCTGATTGCTAA 
ATGTATATCAGACGTGGAACAAGGCATGGAGTGTTGTGGCAGACAAGCACAGGATGCATTAATGACTCGCCTAGCCAACTTAAAATTGGGCGATTCTCTT 
AAAGAAACTGATGTTAATTTGGAATACTTGAGATACGCGTCTACGCCCCTCCTTGGGGAATTAAACTACGACAAACAACAATATGCGGCAACAGTTGACA 
TCAACCTAATGGCTCATTTCTCCTACGCTGCTTTGGCTATAGAAAGTATACTGAArrCTATACGGAGAGTTGTAGTGGCTAATCATCAACGTAGAAArAA 
TGGAAAAAAACCTTCTGAACCAATCTCACGCCCTCACCCGCTGGGAGGGGTAGAACCTCCTCTATCGTCAGAGTTGGCAAATGCAATAAGGGACAAGTTC 
ATCAGCATGGGGGCGTTGGACAGATTGAATTCAGCAATAGTGACAGCGGCCTTGGGGGCTATTGCCAGTGAACGTGAACTATTCTTACGTGAAAATGCTG 
TAAACTACATGTACGATGTAGAATTTGCAGAAAGAGATGCTGCTACTACAGATACAGGGAATGTAGTCTATCTTTCCACCAAAATGGACGAAGATGAAGA 
TGACATAATAAAGCGTTCAGAAATATTAGATAAGGTATCAAAACGACCCGCAAAGGAAGGTATAGACTGGCGCCCCACCCCTGACAATTCGTTCCCTTAC 
CAATTGATTTGGGGCGATGATTCTGTAGATGATACTGTTCTTATAGATCTCATCACCAATGCGATCGTGCCTAATATTTTTATGGCAAAATTTATCCTGT 
TCATATGTAACCATTTAAGGGCAGTTATTAGGAGTATGAGGGAAATTTTATACGGTAACATTTCTTCTTCATCCGATAATTATTTTGAGGATGGACGTAA 
ATGGTGCTTCTGGTTGAACCTGTACAATAGACTGGAATGGTTCATGTTAGTAGTTAGATTTGTAATTTTCCTCCACTCAAAAAAGGAGTCCTTTTCAGGA 
GCTGACAATGTTAACGTGAAAAGACTTCTG^TGOTAGTTGTGGAGAGTTTTCCTCCCGTTCTCTTGGACACTGAATGGGTCAAGACTAATATAACGTCAT 
GGCCTGTTATTAATAACAGCAATAATAATAGTACACTCCCTG 

TATTTTCGACGAAATTAACTCCTTGACAACAGCAGTGACCAACCGTATTACCTTCCAGTCTGCAGAATTCTGCACAAAGATTTTGCTCGGGGGAGCTCTG 
GACGAAGAAGAAGCTGGAACAAAAATGCTAGTAAAATCAGTCAAAGAGACGGGAGAAGAAAAGGATAAGAACAATACGTT^ 

TGAAGAACACAAAAAATGAAGAATTGGAAATAAACATAGGCGATAACGATGATGAGACTACAGATGTGGCTTGTTGGGCACGTACTTCCTCGACATCCTT 

TATCCGTAATAGGACATATGCGTTTAAAAAAATATGGGGCCTTGAGGATGCAAGTGATGTAGTCGAGCTGAAGCGAGAGAGTGACGCCATTACATCCTTT 

GTCACCGATAAGAGCAGTCCTCTCCTATTTCCCTATGTGTCCGACTGGAGTTGCTTACTATTACATCCCTGTTGTAAAGCACCGGCCATAATTAAAAGTG 

TGTGGTTACAAATCCTGAAAGATTTTTCCCAGGAAAATATAAAAACTATAAATGAAAAGGTACA 

CCGTTTTAAAAATAAAAAAATTGCTGCCGAACACGTTCGCAGTGTAAAAAAGTTATTAAATACGATA^ 

GAACACTGTATTTGGTTAACGATTTTGTGGAAACAAGTCGTTCAGAACACTCTCAACCTTCTGGAGAATTTTCCCGTATAAAAGGGGAAAGTACTCAACA 
TGTTCACTCACTTGACCCGAGCCTTCCGCAAGATGAACAACCTAGTCAATCGCAGCTTCATTGACGTTCACCGAGTGGTGGCCGAGTTATCCTATCCAGA 
ATTCGAAGAGGATGTAAAGAATCCGGAATCATCC^TATATAGAACTCCGATATCCCTCTTCCAAAACAAGGATATTGTTACAATAGTTGGTGATTACATC 
CTCTCTCCGAAGACGGACTCATTCCAAGTTCTATACCCAATCAAGAAGGTCATCGAACACTTCCCAGTAATCTTCCACTGCACCCACAATAATGCCCCTC 
TCTGGGTACACCTTCTGGACGAACGCCATCATCGCCTGCTCCAGAGCCTGCTGACGTACGAGATTGTGAATGCCAAGTACAGGGGTATTGTTGTCATCCC 
ATACTACAGGCGCCCCATCAACTATCAAACTGGGAAGAGTCTACTGATGAGCAAACTGGCGTCCGTAAAAGTTCTGGACATTTTAATGAGATGTGGATCA 
TACAAATTCATCTCATTAATGTGCATGATCAACAAGAAGAACAACACCAACTTTCT^ 

TGCTCCACATTGCTGAAATGTTCTTTGCCAACCCTACTACTAGCCAACACCTATCCGACGCTAGTAGTTTCCCTGATGCTGCAGCAGAGGACGACAAGGG 

GAAAACACCTGCCCATCTAGCAATCCAAGAAGATAATGCTGATGCACTCCTGTTCCTGATCTCCCTCTACGGCGCACCCTGGTTTCAAGATAACAACTCG 

TACATGAAATCTGCCCTTGAACTCAAGTCTAACAAGTGTGTCAAGGTACTATCCTTTGCAGCTGACAAGTACGAGATTTTACCCAACATTAACAACAATC 

AACTAGAACCAGATACCATGTGTGGAGTGTGTGCAACATCTGTGGAAGAAGATGAAAATGAAGGGAAAACAACAAGTCTTTCCTGGTACC^ 

CAAGCATTACATCCATTGCGAATGCCTGATGGGAATGTGTGCTGCTGCTGGCAATGTACAATGCCCCATGTGCCGTGAGGATGTGGGCGACGAAGTACTG 

GAAAGATGCCCTCCTACAATATTTAGATGGTTAAAACTGGCTGAGAGATCTGAACACAATCGTGTACTTTTTGAAGCAAAAAAGCAAGAATTCTATAAGC 

AGATGGAAGCAATGAAACCTCCCAGAGTTGTTGTTCCTCCTCGCAGGACATTTCTCACCCCAGCCAGAAGAGGCGAACGAGCCATCAGAATCGCAAGAGA 

AATTGCCACCAACGCCATCGCTGAAGCCACAGCTCAAGGAGATGTCAACTCCTACTTCCCTGTTCTCATTGACGGGAGCGGAGAAGAATATGAAGAAGAG 

GGAGAAGAATTCTTCAATTCTGAAGAGGAGGCGCTTGCTTTTGGAAGACCATTTCTGGAAGATGAGGAAGAAGCCAGACAAATACAGATGCGCCAGTTTG 

CTGAACTGTCTAGACGAGGCGTTTCTGTCAATATTATTAACAATGATAATCCTCATCGACACATCTCTACAGTAAATATTGTGCAACCAGTTTATGGAGT 

TGAAAAGTCACCTGCTGCTTCCTTCATCTACAACATGCTCAAGAATGACGTCTTTGAGTCTATACGCTCAAGAGATACTCGAGTTGGAGGAGAAAGAGTG 

CCCGTCATGAACCTGTCCAATGACAAGAGGGCATTATTCCACGCAGCTTCTTCCATGCTTTGTGACTTTGCCACAGAAACAAACTCTCAAATTGTTGGAT 

TGGACTTTCAAGCAGTGTATGATCCCCATCACATATCCAACTATATCGAGACGTTTGGTAGTCCTCTTCACGCCTACCCAGGAGCCGTCACTTTTCTGGA 

CGGGGCCCAGGACTATTATGCAGAGAGTATCAGATACGACAATGATATTGTCTCATTCTCAGAAATGGCAAGTGAGTTGCACATCACCGAAGCATTAGAT 

GTCTTTGAGGGTAGTTTATTATCCCCACTGTTCAAGAAAATCAGGACTGGAAAATCTTACTCTAACTGGAACGACCATTTGAGGCGTAGAAATTATGCTC 

GAGATATTGCTGAGGAATTTGTTAGGGTATGTGAAAACTCrCTAGCTTCACGCGAACACCCCCCTGTTCATGTACATCCCTTTAGAGATGGAGCAATCCC 

CATTCTCATTGAATATATAGTAGATTTCATCCACCACTGCATCACCTGGTCTATGCAAGTTAATGCACTCCATTGTATGAGAAAGTACATTGAACACGAG 

AATACAAATGTGCACCTGTTAAACTTGCGTCCTACTGATGAAAGGGTGGAAGTTTTAAGGGTATCT^ 

ACAACACTAGAATGTCCCTCAGCACCAAAAGATTGAGCCTCATGAAGATCTTCAACCATGATTTGGGTGTGTCTAAATTTGGTGTATACAAACTCCTAGA 

TATTATTGAAATGTACTGTTTTACTTTAATCTAAACAATAAAAATAATGTAAAGAATTCCATGTTTTTCATCCCTCCTCCTCTGTGGTGTGTATATATAA 

GGCGGGTAGCCTCCTCCACAGGCACACTTGTCCAGTAGCCGTTGTCGAGTGAACATCATGGCAGCAGACCTCCTAGAGTTGGCTATCCAGGAAACAATCC 

AGTCTGAATTGGAAGAAATTGCCGATACTGAATTCCTCAATTATCTTCCCCATAAAACTGGCATCTGCGAAGAAGCTGCAGCTAATGGACGGCCAT 

T C CT AC AC T AGAAATG AGGAAC G AAGT TG ACC ATTTCTGGTCCC AAG AT AAC AGGAAGCTG AAAC TC TTGGGGC ATTTTTGTGGC AACT TGTATGTGG AG 

GC ATTT AT AGCTGGTTC TAT AG AT GC TGAAACGTGCGTGGGC TT TTTGAGGTCGCAAGC AAC TGG AC TAGG ATACCCTCT ATTG AAAAAACT AGCCCTG A 

TTGCCCGTGAGGATAAATCAAATACAACTAATTACAACTTGTACATTGATAGAAATTCAATGATGAAACAAGTTTTTAGTGCTGAAATTGATAAGCG 

TTCATCAATACAGAACACTTCTCACACAAAATCTTCCCCCGTGTACTTGAAACTGATCGACAGAAGAACCGAGTGTCTTGCCCTGGATTGGCTGGACGCA 

TCCAAGAGGACGGCCAAGGAAATTGGAGCAGCCAGAAAGGTTTGTTTCCTCCAGAACCTGATTGTTGCCATATTAATACCTGCATACACAGAAACGTTTG 

TTCTT G ATAC AG GC AAC G AACT AG AAC AGC AAGT ATTGGATGATGCATATTT T AATGCGG AG AAT AA AG AT AAAGTGGATGAAATGTGTGT AGTGGCC AT 

ATTG AGT AC TTT GC AC A ATTT ATTTGTTAGG AAAAGTCTTCCCC ATC ATTTGT ACAATGC ACCTTTC CG TCT TC CTCCC TTTGG AC AAC ACCCTATC ATC 

AAC ATTG AAAAT TCC TC AT TC T TT AATG AAG AT ACGAC ACCT ATT CT AGCGTC AATTTC T AT ACC ATC AAGT ATGGT AAT AAAAC AC C AT ACGCGAAAAA 

ATAGTAGATGGAGATGTCCCAATAACTTGATGACTGCAGCAGAACGGTCTATATTTTTACGTGGAGTTTTAACAGTCAGTGGAGATTACGGATGGTTTTC 
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TGTAATTGTAGGATCTACTATAATGCCTAGTGTACTTTTCTATGGCGACCGAAAACATCTAATAAATACAGTCAAATCTAATAACTTTTCTGCCATAACC 
TGTTCTTATTGGAACAAGTATATGGATTGCAGATCTTATGGGTTTGAGATTATAGACACACCTGAAAATAACTGTGGTTTTCGTATAAGGGCTGCAATTG 
ATTGCTCGAACACAGATTTCCATTCACCGGTAACGCGAGTCAACAAGAAGAAAACGAGCATTATTAATGCGGTAAAGAACCCTTTTTTTATTAGACACAC 
AGAACCTAAGTGGTACAACAAAAATGCCATGTGTGGTGAAGTATTGGAAAATGTTGGCGTGACCCTCGAACAACACGTCCGTGTTAGTGATGAGTATATG 
GACAGATTTGGTAGTCTATTACTTGGACGAGAAAAGAAATGGACGTGTAATTATCTAGATAGAATAAAGTCTCTAGAAACTATTTCTAACAATCTCAAGG 
GTAAAATTGACACCATGTGTAAAATTCTAGAAACCAAGTATAACTACAAATCTTCTTCCTTATACTATAAGCAAATAACTGCTACGTCTGATGATCCTAT 
AAAGATGAAGATTATCGCCTCTATAAACAAGAGGAGGTATTTGTGTAATATTCTAGAATTTGCTATAATTTCATCAGAGAAAAAGGATGAGGTGGAGGAA 
GATCACACTAAAACCGGCAATGGTGGTTGTGCGTTTTCAAAGTATAAGAAAAAACAACTCGAACCTAAACAACATTTAATTGTTAAAGTGAATAAATACA 
TTGAGGCTTTTTCGTTAATTAAGATGCTGAGAAATGATTGCGAACGCAACAAGTGTAGGTTTAAAGAGGCTGAAATTAGAGAGTGCGCCAACGAACTGGT 
ACGAGAACTGTATAGGGCTTCGGCCAGAAGTTATGTCCACGATCTGGTATTGAAGCGAACTAATGTCCACTTGACATGGCAACGCCCTTACGATGAAAAC 
GCTAACACTATCATGTCTTTAATACCTAAATGTAAGCTACACACAGTATTGTACGATAAAGATTCGCGCGATGTTAAGTTGTTGAATTTTCTGAGAACGA 
GGGATGGAAATTATAACCCAATAAGACACTCTATGCTGGAGTTAGTATACGGAGAGGAGTACGCTAAAGATGTCAGTACTGTTACCTGTTTTGAATGGTT 
AAAATGGTGCTCGAAAAAAGGTGTGATTAAATATGAAGACTTTTTGGATCGTTACGAGAAAACGGGGGAGGAAGATAAAGACGAAAGGGAATTCTTTAGA 
CT AAAAAAAT GTAGT AG AG AT C AC ACT AAGGATAT AAAAAAAAT AGAAAATGT AC T AAATTC TG AT AC ACTT TATTC TT ATTCTC TC GAT AAAAATGTGC 
AAACCCACGCATCTTCTAGTACAGTTGTAAAAAATGACACTGACGGAAAAACGTCTATGGTGGGCTGGGATTATATTTTTTCAATCGGTAAAGGAGAAAA 
AACAACCAAAAAACGAAAACTGGAAACGATAGATATATCGAGTAGTGACGACGACGATGAAGAAGAAG 

AATAACTGCAGCAGCAGCATCAAGAACAAGAGCAAGAACAAGAATGGGAGAATGTGTTGCACAGATATTCTCAATGrTGTAGAACCTTCTCTACCTAATA 
CTTTATCGTTCAATTGTGTAAAAAGTATGGATGTGTTGAATTTGTTATGATCTTAAAAATAATAAAAATATTG^ 

ACACCTGAGCGGTTTGGGCCTGTTGACGTGCCTAACTrTTTGTCACCCTCATGAATACAATTTGTAAAGGTGCTGAAATGTACTTGTTTTTTATC 

TrCCTGTACTGAAGAATATTGAAAGAAGACTTCTTGAAGAGGACCGATAAAAAAATGGCCACCTTCCAGACTGACGCCGATTTCTTGCTGGTGGGGGATG 

ATACTAGTAGATATGAAGAAGTGATGAAGACTTTTGATACTGTTGAGGCAGTCAGGAAGAGTGATCTAGATGACCGTGTTTACATGGTGTGCCTAAAGCA 

GGGATCTACTTTTGTCCTCAATGGAGGCATCGAAGAATTGCGTCTTTTGACTGGAGATTCAACGCTGGAGATTCAACCCATGATTGTGCCAACAACAGAA 

TAAAATAAAGACGGTGACGGGAGACTAATATCTTTCTTAGTTTCCCGTCACGGTGAAAATGTTGGTTATTTCTTCCCTATGTTTAAAAATTTOT 

T AAAAAAAT AAAACG AAAACTGTC AAT AT ATT GTTTT ATTGATAT AC AAT ATCCCTTTT T AC AC AG AAATGGC ATGTCTCTTT AC ACGCTGTTGT TC TT G 

AGTGTTGTTTCCATCTTCTTCTTCTTCTACGTCGCTATCAAATTCATTGTCGTTTTCCCAGGTCCTTTTTACTCCCCTTTGCTCCTGTTCACTGGTAGAC 

TGTTCTCCATTTTCTCGCTTCTTTTGTTCCTCCCTCTTCTTTACCTTCTCAAACCACTTTTCGGTCTTGTTTCTCGTACGAGCTTCCTTTGTGGTCAAAT 

GTCTTTCCTTGTTGTACCTAGTGACAATTTTTGATGTGTTCTCCATCATGGAATGGAGCTTCCTTTGTTGTTCAATTTCAAATTGGGTCATATTTTCTTC 

ATCAAGTTCTTGAGGCGAAGAGATGTCTATCCTCTGCCCATCTTGATAGACGGTGCCATCATTCATAAAGATTCGGATATCAGAACCTGTCACTTTTTGT 

ACAATACCCTTCAAATCTTCATCACTTGGAGGGGTCTGGAAAACTATACTTGATCGGTTACATTTTAATACCTTGAACTTGTTTTGGTGTTTGATACTGT 

CGTTGCAAGCGATAGGTGCGTCACAGCCAGTCACGGCCATGATGAGTCTGGGCCTCATACCTGCAGTATCCCTTTTCTTGTTTATGGGGGAGAAAATGTA 

CCTCAAACCATTTTCTGTTGCAGTTTCAAAGGAACCACTTTTGTCTGTCCTGCACAACAAGTAGAATTCTGTAGAGGATATCGAGAAGACGAGACAAGAG 

CCGCTGTCGTATCTGAAGGTCGCATTGGAAGATTTCTTGTGTACATTTCTGAGAGATTCTTTTTGTTCCTCAGAGACTTTGAAAAGTTGTTCAGTTTCGA 

T AT AC G AGTTTTTGT ACTC C TC TT T CTTT ACT ACTTGTGTTT CT AGTTCCTTTT C AACTTTTGTGGTCTTT ACT ACT TGTGTTTCT AGTTCCTTTTC AAC 

TTTTGTGGTCTTTTCGACCTGGTTTTCTAGTTCCTCTTTTATTTGTAAATTTTTCCTAGCCAACTCTTCGCGTATCTGGCGTTGCTTTTCATCACTGTAT 

AAGATAACAGCGTAGGCTAGTGTCGAAAGGTACATTTCAATGCTTGCAGCAGAAGGGTGTGCGCTCCTTGCAAAAACGCCCGCCACCATAAACAAGGGAA 

CAATCAAGCAATGTGAAGAATGGCTCTTGAAAATAGGATTGGCTATATATTCTCCTTTGTAATGGGTACGGTGACAAATTTCAACCAATGGGACCATCAT 

ATCCTTGAAACCATGTTTCTGCATAGTTCTCTCCATTGACTTTAAAGCGTGCGTGACGTTCATTGTACTGAAACCTAAACCGATTAAGAAATCGGAACTT 

ATTATCATACAATCGTGTTTTTTTTCTACTGCTTCAGAAAGACTATTGCTCATGTACCTGTATAATTGACTGAATTTAAAGTCCTTGTCTTTTCGTTTTA 

ATTTTTCCTTGACTATTGTATTGATTTTACGCACGTCTTCTTCTTGCGTTGCAAGCCATTCCTCGTCAGTAACGTGCCCAGCATCATCTTTGAATAAATG 

TCCAGGTTCAGGAGGGGCACAACTTATCCGTTCAATTCTTAAAGAGTCATCTTCTACCAATTCTTGTAATCTTTTTAACCCTCTTGCGAATTTGAGTTGT 

TTATCTTTCTTCACATGTACAACAAATGTGTCTTCGGACAATTCGCTTTGTCTTATAGATTCGATAGCTTCATAAACACCCATGACTTGCTCATATTTAC 

TTGTTTTCTTCCCGATCATCAGGAATGACGATCTTAGTTTTAACTCCTTTTCTGATACACTTTCATTACTAACAGGGTTATTGTTTTTCTCTCCAATGTA 

TTCCATATTTATCTTCTTCAAAACAACAACGGAAAATACAGTTTTCTTTTAAGAAATATAGGTTACAATACTGTGGATTGATAATTTTACTCACAGCTTT 

GTTCAAAACTTAACTACGTGGTTCGGCCCATTTTG<^CCAGATCTGTTTTAAAGGAACTGGGTAAAATACACATGAAAACACTCCTTTTGAACAAATTCA 

GATTTATATGTATTTGrCGATTTTTATTAATATTAATAAAGTATATTTTTAAAAATTAGGATGGGGACAGATGGTGTAGATTGATACAGTGACCGTCCCT 

GrTATTTTACCTTAATTTTTTTTCCCTATAATACAAATAATTGGTATCAGTTTAGGAAAACATATTTTTAAAATGAAGGTTATCATATCAACTAAACACT 

TTTATTTACCCCCTTTTCACCTCAGAAATATCCTCACATAATCGCATGCAAGACTTGCGCATTTCTTCCCTGCCGTACAAATCAATAGCAAACGCTAACG 

CCAACAGATACGTCTCCACTTCTAGAGATGGGGGTTGAACATTACGGGAAAACATGTACCCTAGATACACCAGTGGAATAACTAGGCAATGGGAATGATA 

ACTCTTGAAAATGGGGTTTACAACAATATCACCCTTGTATTTTTTGAGATGGCACACTTCGGCCACAGGCACCCATGTATATTTCAGACCACTTTTCCGC 

ATATTTCCCTCTATCGCTATTAATTTTTCCATGACATCATCAGCACTAAAACCGAGACCAATCAAAAAATCTGTGCTAACAACAAAACGTTCCTTTTCCC 

CGAAAGGTATTTGAGGAAGACTTTTTGTTATATACCTGTAAGTTGGACTGAAATAAAACCCTCTAAACAACATCCGTGTCTTTTCCTTTACGAGCTTATT 

TACTTCCTGTACGTCTTTGTCTTGAGAAGCAAGCCACTCGTCATCAGTAGTTGTAGACGTGTTTTTAAGTATAACGTGTCCTTTCTCAGGCTTAATAGAG 

GATAATAATGATTCTATCTTGAGGGAATTGTCCCCTCTGAGCTCTCTTAATTCTTTCAGTCCCCCTTCAAAAGTAATGTGTTTATTTTCCTTTAATTGTA 

CAATGAAAACGCCGTCTCTAAATTCACTCTTTCGGATAGTTTCAACTGCTTCAAATGTACTCATTACTTCTTCATACTTTTCATATTGTTTTTCTCCAAT 

CATTAGAAAAGAAGAGCGTAGCTGTAACTCGTCTTCTTCAGGTGTCTCATCCAGGAGATTTATCAGTTTTTGTTCCCCTATATACTCCATGGTTTATTTA 

GTCAGTTGATTTATTTTTCTCTTTGACACAGTTTTTATGGTACTCTGGGATTACTTTACTCAAAAATTTACTCACATGTTGTTAAAAGTTTGAGTGTATT 

CTTTTGTGGTAAACTAACAATGAAGTACATTTCTTAGTAAATGAG^TAGGTACTGATTCGTGATGGATAACGATGGCGTCCCATGTCTAGTGATGTGAGT 

CACATAATAAAAAGGGAAATAAAAATAAGAACTAGTAACATATTTTTTATTTTATCCTTCTCAATATGTTCTTTGATTTCTTTCTAAAAATGGCTGCTGC 

AAAAATGGACGCTATCCTTGCGGATATTAATGGGAATGATACAGATTTATCC^tflGCTA^ 

AGAAATAGGGCTAAAATGGACATGAATAGAAGAGTAGATGAGGCTATTCAGGAAGCCGTAGCGGCCAAGAAACAAAAAGCATTAGTGGTATTTGATAAAC 

TCGTGGAAGAAACTGACAGCGGACAAAGTGTCCCTCCAACATTATCGGGATCCGATTACGACGCGTGGGTAGACAGAGCCATGCCTTCGCATATTGAACT 

TGTAGAGAGTGTTGAGGGAGATTCTTTGTATGATAAACTCCCTCCTTTTAACGTACAAGACATAGACGACCAAATCGGT'GATGAGATAGATACACCAATA 

TCTTACCTTGCCATGGTAGTGGTAAAAGTCGACTGTGAAACTGGGGATATCGAAGAAGAGTACAATCTTGCTCCTACCTTTGGTGTGACACAAAATAATA 

AAATATACAGAGATGAAAGAGACCAGATTTTTACAAAGGCTGATAAATCTGTGCGTATTTTTAAACTTGCTAAATTGGATAGTATATCAGGTAAAAGTAG 

ACAACTGACGTATGCGGTAAAAAATAACAATGAATATACAGAGTTTGTCTGTAGCGTCTTTGCAGAGTTTGAATCTGACAGTGACACTACTAAATCAGGT 

ATAGGTATTCGTGAATATGACAAACCTAAAAATGAAT^rcGAATATGAGGAGCGAGAGATCTTTACATTTTTTATTCCAATACAACCCGCAGGGACGAAAT 

TATTGTTATACTTTTTGGTGGACGTCAGGTCCCGTATTATATAGGACTATAAGGCACAACACTCTGTTTGAAGAGAAAAAATGTTCGTCATAAGCATAGC 

TACGTCATTGGTGTTATTCTTCTTCCTTCTTTTCGTCTCAATAACAATTTTAGATGGTGCAAAAACAATCGACTCTCAACCTTTTAGGAAAAGGAGGAAG 

AGGAAGAGATATCGTACCAGTGAGAGTGGTGACGGCATAGACGGAGGAACTGGAACAACAAACGGAGGAGGAGGAGGTGGAGGAGAAGGAGGTGGTGGTG 

GAACAAATGGAAATGGAACTGGAACAACAAACGGAGGAGGAGGTGGAGGAGAAGGAGGTGGTGGTGGAACAAATGGAAATGGTTCTGGAACAACAAACGG 

AGGAGGAGGTGGAGGAGAAGGAGGTGGTGGTGGAACAAATGGAGGAGGAAATG^AAATGGAGGAGGAAATGGAAATGGAAATGGAAATGGAGGGGATACA 

GACACAGACGATTT^GAGCCTACGCCAGCCCTTCTAAAAGAACGTCTACTTAATTCCATATCTTCAAAACCTAAAGAATACTATGAAGCATTCGTATCTG 

CAGAGGTAGAAACTGCCTTACAATTATCTCGAGATGATTCTACCCAAACTATAATAATTGATGACGACCAATTAGAATTGGACGCCTCAGACACTCTACA 

AGGAAAACCAAGGGATTATTTGTTCAAGCTCGCAGGTGTTTCATCGGCCTTTTTAGAAGGTACCACTATCCGCAAAGCGGAAGACCGTGCCCGTAATATA 

AATGAGGAAGAAATTGCACAAACAATATTAAGTCAACTAAGAGAAAAACACATCAACGATGAATACGATGGAAAATATGCCACACCCGAGGAAAGAGCTG 

ATTTTTCCAATAGTCTTAATCTTGTTACTAAATACACAAACCATGAAGTGGGTCTTTTAGTAGGAGAAACTATTGAAAAGGCTTTCCCTCATG 

GTTTGAGAGATGCATAATTCTAGTAGAGGATTTTAATAGTGGAACTATTACTTCAAACACTATGCAGTACAGGTCCAACGCTTACAAAATCAGAGTAGTA 

GAAGGATCAACAACAGATCCAGGGGAAGTTGTCCCTGATGATTGTTTGGTTTTTGCCGTAGTAGTAAATAAGGAACAACATTCTCTAGAAATATCTGCAA 
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* CTAACAGATGCCAAGACATATGCTTTGTAATTATTCCTCGTTTATCCGCAATAGGAAAAAATGCTACCATGGTAATAAGGAAAGGTGATGAAATrAAACA 
AGAAACCTATCTGTTTGTGGCCAATAAGAATGACACTACTCATTTTTCAATCATCACAGACAAGGATGAATCTGTTGGAATAGAATTAAACATGCTCATT 
TTCTCAGAAAGAATTCTCCCCACTTTGAGTGACCCTGCAACCGTTCCAAGACCTTTGACTGACGCCAACGTACTGTCAGCCTACGGAAAGCGCCTAGGTG 
TTGGT GCCTTT AC AG AC AAAAATTT ATTGTCC AGCC AAT AAAAAAAAT AT AAAAAAAGAT TG AT TGT TTT ATTTTCAAAAAAAAT AAAAC ACCAT GGGAG 

^ ATAAGCAAAAGGTGGAACAGCTTTTAAGGGAATTAAAGGCAGAGGCTAATGATGATTGGCTATCGGCTAATGTTGATCCAATTGTTGAGAGATTTGTCAC 
AACTAAATCTGATGAGACAGCCCAAGTTGTTAAACAGGCCGTTGATGAAAAATACGATGAATTATTAGAAGATAAGGTTGAAGAAATGAGACCGGATATA 
ATCAATGAAGCATCCGAAACATATGATAAACTTGCTGCTGATATGATAAGAGAGGTAGACACTAGTAGTGTTATTGCTCCTGCAATAGCTGGCACAGTGG 
CAAGAACTATCAATAATTTAAGAGATAAAAGGAAAGAATATGAAAAGAGGCTATGGACATTAGCCTACAAACCATGGAGAAGATATGTACAAGCAATTAC 
AG TG ATGGAATTTCGTTTATC AT AT AAAG ACCTG ACTGTC C ATGCC AATTCCGAT AC AT ATTT AAC ATTCCC TTTTT T AAGAAT AAAAAAGATC GC C TAT 
ATTAATAACGACCGCGCCAGCCCCGTGAACTGCTCCTTGTCAGTGTCTTATCCCAACAAGTCCGAGTGGGGAAATGATAACGGAGTTGGACGTAAGGTTG 
ACATACACATCAGAAGAAATGATTTACAGGAAAAGGATCTTTACCTGTCTGTTATTTGTATGTTAGATACTGATTTTTCAGGATATGATAAAGCTGTGGA 
AGTTGATGCCCATAAATTTCATTTTGAAGCAGGGAACAGGACTATGTTTTTACCCAAAACCTCTAATCTATTrAATCGATCGCATATAGTAAACTCTAAA 
ATATGCACTATAGTGTTCCCTCCCGCTTCTGCCAGTTCTGCTTCTACAACAGAACTGGATAATGTCTATTACAGGATCACATGCACATGCTCCTAGAGGG 
GGGGGGTGGGTGTGTCC TTGT AT AAAAAGCGAGGT TGGC AC AGAC T ATC AAAC ATTC AGCCGT AAAC CATGG AGG AAG AAAGTC AAC G AGT AC AG AGGAG 
GATCGGAGTCCTCCCAGAAGAGGCTGCCTCCCAGATCCTGAAAGACACCAAGTTAAGAGTATCATATCTGGGCGTGGGACATTGGGGATACTCTGTTTCA 
GTTATAAAAAGTGCTCTCCAGAAAGGATGCAGGAGGAACGATGAGGATATAACTGCGTGGTCAATCAGGGAGGCTTATCTCTATTACCATCTTGGATTGA 
ATTACATTGJWVACGTAAAACCTGCTGCAAAATCATTAAATACAAATATGGTAAACAGAATTAAAATCATAGCTGTGGAGGATACCAGTCCTAGAAGTAT 
GGTGGCTTCTAATGAGTGTGTGAGAACTCTAGAAAAATATGAAAAGGGGAATTTTAGGCAACCCAGTTACTTGATGGATGCTGCCATGAGGCTAGTTCAC 
GCCTCCAGTTCTAGAGTGTGCTCCCACATGAGAGCTTTGTGTTGCAAGGAAGAGGACAGTGATAAATTAGGGGGTATTTATTATGCTAATTTTAATGAAC 
TTGAAACTCAGTGTGTTAGTGCAGTTAATTTTTCTCCCATAGAAAGAATTAAACACGTCTTCAGGGAGATTGAAAGTGTAAAATTGGGGAAGAAAAGTGT 
ACAGTTATTAAATTTAAGAAGTGTAGCAGCTTACCATGTGTTAAGATATTATGGGGATAAAGTGAAAGATACAAATAAGAAACATAGTGGACCATTCAAG 
CGAAAAGAGTTTGAGCAGTTTTGGGGGTTATGCTTTAAATTTGTTACTCAGCACGTAAAAACAGACCCTGAATTACGTTGTTATTTTAATGAGTTGACAT 
ATGCCATAAATTGGAGAAGGGATTTTTTCTGCTCAAAAGGGTTCTTTAGAGAAGAAAGTCTATTCCTTACATCTATTGTGGAATTAATTATAGCCATGTG 
CATAGGTGACCGTAAACAGTTTGCCAAGATTCAGAAAAGGGATTTAAAACGTTTCAATAAAGGAGAAGAAGGGAGAAAAGAAGAGGCTGCTACATTTGAT 
TGGATAGAAGGACATGTTAAACGGATGCCTCAAATGCCTGTATGGGTTCTGGACAAACATACGAACAAAAACACACATGGAGTATCTTTTGCCCTAGAGA 
GTAGTATGGTTTCTGGAGGAGACAAGCGCTGGTCCCCAGGGGTGTGGCTTCATTCTTATACAAAGATGGGTCTAGATTCTCCCCCTCCCCCAGAAGTGGG 
CCAATTTCTGGATCAGGCTTTTAATACATTGAAGCGAGAAGCTGCTAGTCATTGCGTAACGAGGA^ 
AGTTTTACTGCTAATATTAATTCTGAACCGATGGAAATTAAGGAAGAAATAAAGAAAAGAAAAATTGAG^ 

CTGTTAGTGCTACTACTAGTAGTTCTATTACTTCTACTCCACCTCCTACAAAGAAACAAAAAACAACTCCAAGTGGAAGCAATAAAGTAGACTCTATACA 
ATTGAATAATCTACCAACTTTAAATATGGAGGATTTAGATAGAGTACTAGAAGTACACAACCAAAACTCTAAAAAGGGTGTAGCTGCTACAGTTTTAATG 
AAGGATGGAAATAAAGTTGTGTTTAAGGAGATGAGAAAAAGCTTTGGTTGGGGTTCTCATCAGAATTTTGTTCAAGTACTAAAGGATGAAGATGTGTGTA 
AATTGGACTATTTGTTGCCATGCCCTGATAGCGGACCATACAGAGGATTATACAGATGCTATTTCAAAATAGTAAAGGATGAAATTTCGAGTACAGCTGC 
TAGGATAGAAAAAGTGAAATGGGGAGAAAATGCCATGTGTTATTTCATCTCTGGGTGTGTGACTAGACAAGAAGGAATTGGGAAAATTATAACAGATGTG 
CGCCTTTCAOTTATGGGACCAAATAAACAATATGTGTATGATAACTATAGGCAATTAATACACATTTTAATTTTTAGATTGTTGACTGGTGTGAGTGATA 
CCAACACATCAAACATTCTCGTAGGTGATGGAGGAAACTTGTTCTCTGTGGACGAGAATTATGTGGGCGCAAAGGATCCGAGGACAGCTCTCGAAAATAG 
AAAGATAAAGGAACTACAATTGCTACTGAAGACTTCCTTTAAGGTAAACAAGGTAACAAAGGAAGATATAGATAGCTGTCTTCCATCTTGGCTATTTGAT 
ACATCAAAAAGTGATAAAATAATGAATGGTGTATGTAATATTGGTAAAAATATGGGAATTGGTCCCACTACTTTGGATATTGTAAAGAATAATTGTACTT 
GTATTTTAGGGGTCGTGAATGACCTGCTTTATGACAACAAATAAAGGAATAATACTTTATACAATTTTTCATTTCCTTCCTTTTTAAAGTGTATAAAATT 
TTTCTAAGACAAAAGTGAGTTTGAGGGTTGGTCATAATCTAGCACATTTCATCCCTTTTTCCCCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTATTTGAA 
AGACCCTGGTTAGGTCCCTGCTAAGGGTAGGCTGTAACCTACCATATACACTAGAGAATTTTACTCTTCCTCCATCTCAAAAACTTTTAAAAAATTTTTC 
TGGGTCACTCGAGTTTAGAGG<;TGGACCGCTGGGTCGGCCTGATGTCAAGTTCGGAGGGTGGACCGCTGGGTCGGGCCAATGTCAGATTGC^CCAGAAAC 
GTCTGTTGCTCCAGAAACGTACATTAAATTTCTACTGTTTCTGGAACAGGCGTTTCTGGACACACCAGCACCCAGAAAGGGTAGGCTGTAACCTACCATA 
TACACTAGAGAATTTTTACTCTTCCTCCATCTCAAAAACTTTTAAAAAATTTTTCTGGGTCACTCGAGTTTAGAGGGTGGACCGCTGGCTAGGCCTGATG 
TCAAGTTCGAAGCGTGGACCGCTGGCTCGGGCCAATGTCAGATTGCACCAGAAACGTCTGTTGCTCCAGAAACGTACATTAAATTTCTACTGTTTCTGGA 
ACAGGCGTTTCTGGACACACCAGCACCCAGAAAGGGTAGGCTGTAACCTACCATATACACTAGAGAATTTTTACTCTTCCTCCATCTCAAAAACTTTTAA 
AAAATTTTTCTGGGTCACTCGAGTTTAGAGGGTGGACCGCTGGCTAGGCCTGATGTCAAGTTCGGAGGGTGGACCTCTGGGTCGGGCCAATGTCAGATTG 
CACCAGAAAC^TTGCGTACTACTCCCATGTCCAGAAATATCTAGATCCTTTCTGGTTCTCGCCCACCACCAGATATCTGGGAAGCCTTAGGCGAGTCATG 
TTTCTTGCACCATGTATTTCTAGGAATTTCGCTGACGTCAATAAACTGGTTTGTCATCCTGTTTTATCCAGTTTGCTGACGTCAATAGACCATATTTGGG 
CATCTCGCCCACACAGAAACAGGATATGATATCATAGATTTCTGGGAAGAGGGTGTTTTTTGTGCCGATCTGGGTGTATAAAAGAGCGCTGCGGAGAGGC 
AGAAACATCAGACAGACTTGATCTGTACAGCTAGCAGCAGCAGTAGCAGCAGCCAAGAGAAGATCGGACGCAAACCATTCTCGCAGCCATGGCAGCAGCA 
GCAGTCTCAGGAGAGGGGAGAATCTCTGCAGATCTACTCCTGTTGGAACAACTCACCCCGGACGGAGACGTGATAAGATACGACTCTGAGCAGTACACCA 
AACCTAGGAAGATCTTTGGTGACAAAAGTGTGATAGAGACTATTGGACATTTTCTCATCCACAACCACAACCAAGGTGAGAGTTACCAAATCGCATCTTC 
TGTGTTGGAAAAATTCCCCGCTCTACTCAATTGCATATGGAATGGAGAGTCGGGAGGAATGGCTCTATGGAAGGCATTGTACAGGGCTAAAAAGTATAGA 
CTTCTCAACTCACTTTTGGTCCACAAGATAAAAAATTGGCCTTCGGTTGCCGTGATCCCTATCTACGGGTCAGTGTGTGACAGAGAAGAAAGGCCCATCA 
TCATGAGTGAGATTATTGACAAGGAAACTCTTCAGACCATATGTAAGAGTGATATACGTTCTCTTCTTGGAATGATGAACGCCAAGCATGGCACATTGGG 
AGGTAATTTTTTACACTTCTATGCCCGCTCAACTAAACCGrTTGAAAATTTCCAATATGA^ 

ATTTATGATGGATTCAGAGACCATGGCTTAAACCCATCAGAATATACTTTTCCTGGTTTGGAATCTGCTGATGTGTACGGAAACAATCCAGTGGAAATTG 

CAATATCAGGAGATGATGACAATATGTTGTTGAACCTCATCTGCAACTATGGTGTATCTTATGAAAAAACTCGCGGTCGGGTAAATAGATCC 

TTTTTTAAAAATGAACACAGCTTCAAAGTGTCTCAGTGTTTTAAAGTTTGTTGAAAAACACTTTAAAATTGAGTCAAATACACCAAAAGGAGAATTTGAA 

GAAAAGGCCGAAACATGTGTTAATTGTCTTGATAGAAATAATGTGTTGACTAAAGGGTCTGAACAAGAATCTTACAAGCTTTCTTGTGGACACTTTCTTC 

ATGTAAAATGTTTGAGGAATATTTGTATAGTATCACAACACCTAAGATGTGAAAAATGTCTAAAAAGAT^ 

TAACTTGAATTGCTGGTTGACTATGCCGG^AGGTGCTGGAAATGAAGAAGAAATATGTTTCATGAGAAATAAGAAACTGGTTGATGATTTCAGAAAATTG 
TTGTCCCCTGTCTCAATTCCTCATTTCTTCAAAAATAGTAGACAGCGAAATCrTGATATGTTGTGCCCCTACAGTGACCACACGATAATACCAAATAAAG 
AAGATCCAAAGAAAAACGAAGATGGAAACAGAGTGAGGGTCAACCACACAGCCATCAGTGAAAAGCAGAACAAGGAGGAAGAAGACGCGAGGATAAAGCG 
TGTAGCCGTCAGGACATTTACAGCCATCAGAGAAAAGCAGAACAAGGAGGAAGAAGACGCGAGGATCAAGCGTGCAGTCGACATGGCTGTCGCAGCCATC 
AACGAAAAGAACAAGGAGGAAGAAGACGCGAGGATCAAGCGTGCAGTCGACATGGCTGTCGCAGCCATCAACGAAAATAACAAGGAGGAAGAAGACGCGA 
GGATCAAGCGTGCAGTCGACATGGCGGTTGCAGCCATCAACGAAAAGAACAAGGAGGAAGAAGACGCGAGGATCAAGCGTGCAGTCGACATGGCTGTCGC 
AGCCATCAACGAAAATAACAAGGAGGAAGAAGACGCGAGGATCAAGCGTGCAGTCGACATGGCGGTTGCAGCCATCAACGAAAATAACAAGGAGGAAGAA 
GACGCGAGGATCAAGCGTGCAGTCGACATGGCGGTTGCAGCCACCAACGAAAAGAACAAGAAGGAAGAAGACGCGAGGATCAAGCGTATAATTGACTTGA 
CTGTTGATATGAGGATTCAACGTATAGTCGACATGGCAATTGCAGCTGCCACTAAAAAGGACAAGAAAGAAGAAGAGAAAAGGACAAAAAGGGAACAAGA 
GTTAAGGGCTGATCTGAGAAGGGCAATGGATATGGTGAACGAAGTACAGAAGAAACTTGAAGACATGGAACTAGAAAAGGGGTGTAATAAGGATGAAGCC 
AAGAATACTAGTAATGTTGTTAGTAGCAGTAGTGTTGTTGCCTATTCTAAAGAAATTGTACCTTGTTTAGGAAATAATAATAATGCTGTCATTGGTATGA 
CTAGCACCAACTATTCTGCCAACAA TACT AAGAAT AATGTATTTGGTTCACCTCATAAATTTTCCTTCAACGATGCATCTAGATTCTCC AAT ATTGTAGA 
AACTCCCAAAATGTCTTTCAATTTCTCGTTCAAGACATAAAATAAAATGTGTAACATGCAATACTGTTTTTTGTAAAACCCTTGTGTCTGGATAAAATAC 
CACACATCTGTGAACATTTCTTGCACACGTAACACTACGACTTTTCTTCAACTCACACATCTATTTTTTGCACATGCAAGTAAAACATTTCTTCAAACCT 
CTACAGAGAGATGGTATATAAAGGCTTTGTATCTCCATCATTTATCACTATTCGTACGATGACGTCCAACAGACCAACAACATCCCCTCTTTCGTTCTCT 
GAAGGCTTCTCGCTTTCAGGGGATAAGTATGATACGTATGAGGATATCTTACTTGAACAATTCAACTGTTTCAAGACATCTTCTCCTTCTTCTGCTCGTA 
AAAGTGAAATAGAGGATAAAACTTTAATCTTTCAACTTAAAGAAGGAGAAAAATTCCATCTTGCAAAGGGCATAGAAGAGCTCCGTGAGATTCTAGACGA 
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TAATTCTGCAACAATTGAACCTATTATTTCTCCAACGACATTCAATGACAGAAACGAATTACTAAACCACGAGGGAGATATATCCTCAAGTCCCCTATAT 
AC TC AG AT AATGAAGC ATATTTC AC C AG AGCATG AT ATTT AT GAAT TGG ACCTT ATTGTTGGC AC TGAT TTGCTT TTTGGTCT AG GT GTG AATCT AC GC A 
AC GT TTCT AAACTG ATG AAGAAAAT ATCGT ATGGT ACTTT AAATGTAGTTGATCT GTGC C AC AG AAAAT TTTTC AAC AAT AGGAT T AT AGTT AATCC C AT 
TTCTTCGTCATTCTCAAAAAATGTGTGTATTATTCCTCTATTTTCTGCAGCTGAAGAATTTTCGTCTCTGGGGGAATGCAGGGATTTATTCAACGGTATT 
TGTGATGACGTAGAGAGATATATCAACTCTTATTTTTTCTACCCTGAAAATACTACTACTACTACTACTACTGCTCCTTCGTCGCCCGAAATGGAAATTG 
CAGATGAGGAAGAACAATCCCCAAAAACTATAAAGAGAAATGACAACGCAAGTAGAAACTGGTCTGGTGTCTGTTTGATTTTTGAAGTATTTAAAAACAC 
GTACTACATTATTAATAGAGGAGATAGAGGAGGTTCTTTTGAAAAGGCTGTGAAGAGTGCAATTTCTTCTATCAAGGAAAAGAGATGTAAAATCACAGAT 
AT T AATGGT AAT AAACCTC G AT T GGTT ATGGTGAT AACTGGGTG T TAT AC AGAATTGT AC TTC AAAG AT GC ACTT AAAC AG ATT GG AGAAAACAGGCGC A 
AATT T TTG AAAATG AATGGG AAT T ACTTT TCTCTGATTG ATG AAC AAGC AGATCT AATCG AATTCGC GATGAGTGTTTCTGGTGC CGGGG AGAGGAT TTT 
TGTTAACGGTTTGGGGATGTTCCAGAACCGTAAAATGATACCTGTAATTGATCCTCTCACATATGAAAATGTTGTATGTGGTGAGCATGATATACAAAAA 
GAAGATGCTATTCTTTCTGTAAGGAGAGCTATTGCAGACTATAATGACTTTGTAAGTAAGAACA^ 

AAGATGAAGATGCAGACGCTAGCAGCAGCAGCAGCAGCAGTCCTCCTCCTTCTTCTCCTCCTGCACATAAAAAATCACGTCTTCCGGATGAAGGCGAAAA 

ATGTACACTCTGTTAATTTTTTCAAACAATAAACTAACCACCTTGTATATTTTTGTTTGTTTGCAACACCCTT^TTATGATACCATTAAGACACGATGGC 

GGTAAACTTGGATAATGTTCTTGTGAATATCAACAACAAGGATGAAGATCTTACAAAACTCGTATCCGAGGCAATAAAGCGGCGAGCTAAAACTG 

GACACTAAAAATCAAGCAGGGTTTGACATGAGACGTCAAGTTGAAGCTGCATTATATGAAGCAATATCCAAAAAGAAAGAAAAGGCCATAAAGGCATTCG 

ATGAGCTCATACAAGAAAGAGGTGATGAAATTACACCTTTGACTACAATGCAGTATGAAGAGTGGGTAAACCGTA(TAATAACTCCCTCATTGACGACTGA 

AAATTTATTAGGTGATGTTGAGCACGCCGATTTTTTACTGGACCGAATGACACCCGTAAGCGAGGAAGATATTGAAGGTTTCGCTGCTTCTACTTTTAAG 

GAGGTATCAGATTCAAAAACTGCAACAGTCATAGTTAAGGCAGATTGTGAAACGGGGGATATCGATGAAGTGTATAATCTTGCACCATCATTCGGCGTCA 

CTCAAGAAATTAAAATATATAGGTCAAACAATTCCTCGGAATTGGATAATGTCGCAGATTCTTTCCATATTTATAAAATTTCTGCAACAGATAGCGACAG 

TGGCAATACTAAAAAATTGTTGTATGGGTTAAGGAATAAAAAAGCAGGTTATACGTGTTTGTGTAGAATTTTTGCAGAAATTGAATCAGATGGGATTATG 

GCCAATACAAATATCGGTGTCGCTGAAAACAACAGAGATGAAATTGATGAAAACGAAGAAGGTAAATATGGTTTTTTAATACCCAAACAACCAGCTGGTG 

CAAAATTGATCATCTACTTCTTTTTAAATTGTTGGACATAAAAAATGATATT1TATACAATGCAACCCTTCCTTGGGTTTCTGGTCTTTTCTGTGCTTAT 

TGTAATTGTAAT^ACTGTATTAGCTGTATACACTGCACCTCAAATAAAGAAATCGAAGAAGAGAAAA 

ACTTTGGAAGATTTTGTAAAGGGTCGGCTCCTTAACGCTGTCAAGGAAAAACCTGCAGAGTACTTTGAGTTGCTAATATCTGCAGACACTGAAGCAGCAT 

TAAAAACTGCCGAAGAAACAGCCCTTCGAGATTTTGTTATTGAGAACGACTCTGTCGAAATAGATGTGGAGGAAGTACTTGAAGAGAAACCAAGAGAATA 

TGTCTTCAAATTGGCAGGCGCAACAAGCGAAACGCTAACAAACACAATCATCGCAGAGGTACAAAAAAAGGC^^ 

ATTAAAATGTTAAAACAATTCAGGGCTGCGAACAAAGATAATAAAGACGGGGAAGCAACTCCTGAAGAAAAG<^ 

TGGGGTTGTACTTGAACGAAGTAGTAGAAAAAACAACAAATATTGTCATTAACAAAATATTCC^ 

TGAAGATTTTGATACTGGTGTTGTGACTGATCAAGCCATTCAGATACCCTCCAACAAATACAAAATCAGATTAGTCGAAGGGGATGAACCTGAAGTATTC 
CCTGGTGACTGCTTGGATCTTGCAGTTTCAGTTG AT AAAAT AAAC CACGTCTTG AAAAT TTCTGCAAAGAACGGATGTGAAAAC AAC TGCTTCGTT ATT A 
TTCCACGGTTCTCTCCTGTAGGAAGTGTTTCTTCCATGATATTGGGCAGCACTGACCAAGTCAAACCTAAAACATTCTTATTTTTAGCCAACAAAAATGA 
CAGTACACATTTTCAATTCACAATGGATAAGCAACATTCTGTAGGGTGTGAGTTGGACATGTTAATTTTTTCAGAAAGGAACTTGAGGAATTTACCCGAT 
TCAAAACCTAGACCTCTAAGTGATGCAGACATATTGGCCTCATATGGGAAGCGTCTAGGAACTGGTGTTTTCACAACAGAAAATTTGGTAGACGATTAAA 
TAAAATTTATAACCACTTTATCTGTAGACTTTTATGTCCTTTGTGCAACAAAAATAAACATGTCTTCTTCGTCTTCTGAAACTCCTAAGACTTCCACCGA 
TACTGGGGAAGAAAGGATTAAAGACATTGTAAATGCTCTAGATAATAATGGCGAGTGGTTGTCTTCCTATATTGATCCGATTATCAATAATCACATCTCA 
CGAAAAACGGCAGAAACTGTCCAAAAAATCAACCAAGAAGTTGATGAACGGTACGATAGAAAAATAGCCGACAAAATCAACGAA^ 

TTACAAGTGCTCAGACTATGTATGACCAATATGCAATAGACACATTTCAAGAAGGAAAAGGAGCCAACGGGACTGeACCAGTCATGGGGCCAGTGAACAC 

GGTTATCGATACAACTTTAAATAAAATGAGGGGAAATATGCTCGAATACGCTGAAGATATGTGGGACGGAGATGACTGGAAACGATTTTCCAGTTCTATG 

ACAACGCTTGAATTTGATCTAAGTTACTCTG^TTTAACTATGATGCGTGGTTCTGACGGGTATTTTGCATTCCCTTTCCGTGGAACAAAAAAGATAAAGA 

TGGACGGTTCAAGAAAGAAAGAAGACCCAATTAATTGTATCATTTCAGTAACATATCCAAACAAAGTAGGGGATGAGTGGGAAGAGGGTAAAGAACGTGA 

AGTGAATTTTAACCTAGAAAGAGTAGACGACTATGAAAGAGATATCCATGTTTCAATTTTGTGCATGTTACATGCACAACTTGATAATTTCGAACAAGCA 

TTAGGAGAAAATGCAAACTCTTTTTATTTTAAAAAGGGGCAAAGAGTCATGTTCTTACCCAAGAAATCTAAACTGTTCAATAGACCTACTGTAGJ^ 

CTGATATGTTTTCTATAATATTCCCACCTGCATCTGACCAAGATTTTGCAGATGATATTTATTATCGAATAATTGTAACATGTTCATAATAAAATTTGTT 

GAAATAAAATACAGTAATACATTCAAAATCTTTTATTATTTTTCCTTTCTAATCCAACATATATAATGGTGATACATTTACAGGTGATACCTGAACATAT 

TTATAAAAATC^GGAATATACATAAACATGCTATTGTAAGTTGTATTCATATCCCTGGTGATCATCTCGTGCCAGATGCCCCTCAGCGAGATGGAGCTTG 

CTGTAATCTTTGCCCTGTGGAGTGCTTGTTTGTATGCGTCTTGAGCCCTCTTCATCTCATGTGCTGTAGTTTCATCCCCCTCACAAGGCTCCATGATAAT 

ATTCTTTCTGGCCGAATTATACTTGTAATCAATGAGGTATTGCTTGAAAGGCCCAGTCAAGATGTCGTTGGTTGTAGTCTGGATGAGGTCACATTTCTTG 

GCAACATTCTTAAAGGCGGCAGCAGCAGTCTTGGCAGACATTGAATTCGCTTCAATGCTTCCAAAAATATCCTCATTATTCTCGTGCACCTTCAAGTAGT 

TGCCACTATTTTCTCCCACGCCTCCGTCAACAATAAGGTTGCGAACGTATCTGCGGCTGTGTGTCTTGTTCTCAACAACTTCCCTCTTTAGAAGATTTTC 

AACC ATC TT AGTGT AC ATGTC AG AT CT CC AATTGTTGTTG AT GTCGT T ATGTTGC C C AG AAAAGT TCCCTGGCAAACT AATTCTG AAGGT C T TGT AGC TT 

CCATAAGGAGTAGTGTTATTAATGTGGTTAATGGGCATAGATTCACAGAATGGAATAGCCAGTGTATGGACAGTAGCGTATACCTTGGCAGTCACATTAT 

CTTCACATAGAGCATTAATAAGAGCACTCTTCTTGCTCTCACAATAAGGCTTCACAGACTTGTCGAAAGAGGACCAGAAACTCTTATCATAATCTACCTC 

C ACC ATT G AAGATTC GT ATGT AGTGGAACC CATAG AAAAG AT AAT AAGATTGTCC AC AT C GAC AAGT ACTGAGTAGT ACTGGCTGTTTT GCAGATCAAGT 

CCCTGATCCTCCTCATCTTCATCCTCATCCTTCTTCTTTGTAGACTCGATGTGCACACAATCCTCAGCAGCCTCAGTTCCGGCAGGGAACAACTTAAGAG 

ACTTGAGGATAGTGGACATGTTGAATGAGTTGGTGATGCATCCAGAGGAGTGTCTGCATAGAGATAGACAGACAGGAGTTGATCTGTTCTTTGATAGGAG 

AGAAACATCCATCTCTTCCTCAGATTGAGGATATTTGGTCACTGAAGCAGATGGTCTGAAGCTGTTGTCTGAAGCCAGGTAGAGAGGGTTCCCATCGCTC 

TCGTCGAATTCGCACTCTGTTACATCCCCAAACACAACTTCTGAACACTGGGAAGTTGCACCATCATCATCCTCATCTTCTTCTGAATCTTCACTGCTAC 

TGCTGATAACACTTTCCTTGTCATCTTCATCCTCTACTACTGTTGCTGTATCCTTATCCTTATCCTCCTCATCCTCTTCCTCTTCTTCTTCAGCAGCAGC 

TACAACAGTGACAGATGAAGACGAAGAGGAAGAAGAACACACTGATGAGGCGTCGTCATCATCATCATCGGAGTCTTCTTCTTCTTCCTCTGACTGCTCA 

CCCTCTTCCTTTTCCTCCTCTACATCACTGAGACCGATAGTTTTTCTAATAAGATTCTTGATGACATTTCCAACTGAACTTTCAATAGTTTCATTGTTGT 

AATCGTAATCAACTTCCTCCTCGTTTTCATCTTCGTCTTCATCCTCACTGTCGCTGTATGCATATGCTGACTTTTCGTCATCCTCTTCTCCAAATACAGT 

ATTGTTCAGTGACTTGAGGTTACTCTCCCTCTTGGCCAACCACTTGTCGTTCTTGAAGCAACAACATCCATTTCTCATAGATTCAGTGACACTTTCTTCC 

ACAATTTGGTTAGTGGTCTTGAAATGTGTTCCTGCCTTCTTGGTAGACGAAGAACTGGGAGTTGACATGACCAATCCAGTTGACTTGATTGCACTATCAA 

TATAGTTTCCTCCAAGTTCCTTAAACAGTGACTTATCTCCATTACTTCCAAATATCTGCCTCTTCTTCTCCAAATCCATCTCGTCCACATATACAGCATC 

AGAGAGCTTCAATGTGAAGAGAGGGCCTCCCTTATCAAGGGCCTTCTTGTTGAGTGTTTCAATAGTCACGTGCTCATGGAACTGGAGATTCATACAACCA 

AGGAATAGATTGGCCAGTTTTGTAGTATCAGTGGCGATTGCAGTTTTCTTCACAGCTTCTCCCTCAACCTTGACAATGTTGCTAACCAGATCCTTTCCTT 

CAAATGCCGACTTGACCTTTGAGACACACCCATCACAAAAGGTGGAAGGGGACAGTTCTGTGCGTCCAGAGTTCATCACACTATTGAACAGTACCGATTT 

CTGCGCCTTCTTATTCTTCTCCAATTCTTCGTCCTCATCTTCGTCATTCTCCAGAAATTCAGCAACAGATTTGGCCAGAACACTCACGTCAAGATGAGCT 

GGCACCTTGCCCATAAGAGCGTCCAGTTCTTGAAGAGATTGCTGCTTCTTCTCGGCGGGTGTGATTTCTGGAGGAGTCTGCTGCTGCTGTTCTTCTTCCT 

CTCCTACAACTTCTTCCACAGTTTCCGTGTGTGGACTCCTGATAGTGTTCCTCCTCTGCTTAGAGACCATTCTTCTTCCAGCAGCAAACTTGGCCTTGGA 

AGGAGTTGGTTTCTTGGCAGAAGTAGTAGTAGTATCCTCAAATTCATAAACTTGATCTGGTTCCTCAGAGAAAGTGACAGTTGGCTGCTGCTGCTGCTGT 

TCCTTGATTGACTCCATTTTTAGAATATTGGGTTATTAGGATTCAGTAGTTCTGTTATTCCTAGACAACACGTCTTGTCTGTACGATTAGTCGAGATGAT 

GTGATGTAGAGCCCTTGCCCGTGCCTCAATATATACCCCTCAGTAGACCTTGATAATCACGGTTGCCAAACAACAACGGTTGGAAATCTTAGTAGAAAGA 

ACCAATGGAACAAGTCAGGATAATAATATACAGTATAAGCTTTTGGGGAACAACATTCATTTATTGTCATAGCGCTAACATTAGAAGTACAAGAACAGGT 

GATTCTGCATCCAGCTCATTTTGTTCAACCCACTTCATTTGCAGGGGACAACACTCTTCTTTCTTGAAAAATATGCATGAAGACACTTGGAATTCTTCAA 

TAG<5CCCTGGGGCGTTGCTGGAGGAGGACGCAGAAGAAGAAGAAGAACAGCTGCCGTTGAACGAGTTTATAGTTTCCAGAAGACAACATAATTTAGCTGT 

AATTGGACGAGGAGACTTCTTGTAATGAGAGAACCACAAAATTCTTCTTCCCGTCTTTTCTGGTATAAAAGTAGTCCCATAATATGTTCCTTTCTGGACT 

ACCACATTATCTTCGTTTAAGGGAACATATCTGACCGACTTTCTCATCAAGGGCTGATCTCCATTTCTTGTTGTGCTTAGAGCCGATTTGTATTCCCCGT 
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CGTTCACGTTATAGTTGCATATAGAACAGTGGGATGAAAGGAACTTGTCGGTTACGTGTTCAGTTTGTACAAGATAACTGGACGCATTTGCTAGTTCTGG 

AGGGACGTCCTGGTTTTTGTTTATTGTTGMGTAGGTTTTATCTTTTTCCTATTTAAGCACACACCCTTTGAATTAATGTGTATGCGATTCGCATTGATG 

TATATGCAGATTGTGAGTGCTGGAGAAAATGGTATCTCAAAATCGGTAGCrGGTTCAATAATTTCTGGTTCAACTTCCTGTTGTTCTTCTTCCTCATCTT 

TCTCTTCCTTATTTTTGACACCTAGAACTCTAAGGTTGGCAGTACTAATTATTCTGGCTGCAGCCGTGCTGGTAGCAGAAGGGGAAGATTGTTGGGAGAT 

AACAGGAGACATGTTGACCGTGCCGTGTCGTGAGGATAAATGGCTAAAGGTATATACGGCATGCCCTTTTTTCCGTTACTGTGTGCGAGGTCGTTTTCTA 

GGACCGTAGATGACCATCTGCCTTCCTOTCTTTTTCTGGACAACCTTTAATTGCCATGGTGTTATGAGAGGTAAACTCGCCTCATAGAATCCGCAAACTT 

CCTCTAAAGGGAACAAGTGTTTCCCC^TAGGGGACAATTCAAAAATATAATTCCAACATATTAAGGGTGTTGTTTTATTTTCCTTTTCATCGTACTCCGT 

TATTGTTTTACTAAAATTTGGAAGGTGATTTTTTGATAGACTATCGGACATTATATCCGCTGCAAGTTGACGTTTATGTTCAATGACATAATCATTCGAA 

TCAAAGTTGTCGrGTGTTGACACCATACCATTCCATTTCCATTTGGCTAGTTTGGAGAGGGATTTATTGTTGCGCAATTTTAGACGTGCAGAAACGTTTT 

GCCATATTATTTTAGAAAGTACAGGCAAAACAACATCCCTCACGTCAATACTGTCCATTAATATATCAGTAGAATCATTGAACCTTATAACATCTCTCTG 

TATATCTTCGTTTGATGGGGCCAATCGAACTAGAGGAACTAAATCCCCAGTTATGAGATCGGTGAAAATAAAGCTCTCGTCATCAACAATTTrTTTGGAG 

CCTATGTACATCrCACACTTACTATAAAAGTX^TTTTTATACCCTGGTTTCTTTTCCCACCTAGAACATAAGAGTATTAAAACTCTCGACAAATTATACA 

TGGCACCTACTCTTCCTTCCATATTACTGATAGCTCTATACATATTAACCACGTATTGTATAGGTGCTTCTACGTACAAATCTGAAAATCCACTAGCGGC 

AGTTCCTTCACTAAAGGCTAGAACTTGTTCCATAGTCTTCTCTTCTGAAAACATTTGCTGAATT^ 

GTGCCAGGGTACTGGGACCCGAATTCGTTGAAATGTTGTTGTTGTTTT^CCCTACTTCTTTAGTTT^ 

CCTCCAGGAACAAATCACAGATGGCTCGTACTCCATCACATGTGCATTTTACAGTTTCAGGGAAATTTAATCCTGTTATTTTATAAAACAACTCTCTGTC 
TATTATTTTTCCCATTCGTAGGCGATTATTGACATTAAACCCACAACACTCGCCTCTGATTCCCATTATAAAGGCTAAAGAATATACTATAATATTTCCC 
GCACTTGAACCCACGAGTCTAAGAGTTGTATATTCTCCATCGCAGTTATAGAGACATCTTGCCAATTCGTCCATTGATTTCCTGCCCGTGAAGCGCACAT 
CCATGAGTCTTGCTCCTCCATATCCAATTTCAGCGTTTATTGAAGTCCTCCAACCTTTATCCTTGTTTAGGCGCTGTATCTTTTTCGCCTTCTCTGCCAA 
TACATTCATAAAAGTGTAAAATTTAACGTCTCTTACAGATTCCATAATTTTTTCCTCAAGTGTTAGCGGCGGCGGTGGCGGCGATCCTTCTTrGTAGCCT 
TCAAAGATAAAAGAGTGTTATTCAACTTACAGTGTTTATTGGCCCAGCAATTCTTTTCTTTCATCAACTGCATAGTAGAAGGTCGATCGGAAAGAATGAC 
CAATTTGGAATCCAAATCCCTAGGTTTATCTAGTCCAGCTTCTATGTCCATACTCTTTCCTCCTATTATCATATAGTCTTCAAACAGTGTTTTATTAGCA 
GACATGTTAATCATTTCAATTACAGTCTGGAGGCATGTTTCATGTCTATTGACCAGATCTTCATCATCTTCATCCCATGTCACACCCACCAGAATATTTT 
CTCCTTTCrTGAGTTTGGATACATTTTCTCTAATCTTTTTCATTATCCGTTGTTTCTTGGACGCCACACTTTCGTAGCATAGCGTCGGTAGGGGAGAGTC 
GCAATCCACAACCAAGTATTTTTTCATGATTAATTTGTTGAGGATTGTGTTCTTTGAATTGAAGAAAAAAATCATTCGTTCTGTACCATGGAGACAACTA 
TGGATAATGTCGrTCAGAATAACGACGTAACAAAACCAACACCAGATGTTGCTACTGTTACAACAGCAACTGAGAAACGTCAGTCATGCAAAGAGAAAAA 
GGATCAACTTAAGGCCGAATGTCCTCAAGTACTGAGAGCACTAAAATTGTCCAATACTTTAAAGGCAAATTTTGGAAAATCCATGTCGGCTATTTTTGCT 
CAACATTTAGTGGACATGACAAACGCTAAACACTTTAAGGACCCAAAGACAAAGAAGATTTTAGAACTGGATGGAAGTAGTAGCAGTGACAGTGAAGAAG 
AGG AAG AAACT AGTTCTTC ATC C AAACGG AAAAG AGGTAGTGGTGCT AGAAGTGC TT CTT C AAAG AAAG AAAAAT GCCC C AAT ACT ATCAAAAAT TGGCT 
CAATGATGCTCAAGGTGTATTCCGCCAGTTTCCAGATATCATCATTAATCTTCCCTCTTTTGATGATCTTAGAGACGAAGTAAAGGATGAACAAACTGAG 
CTAAAGACCATATATGACTTGTATAGACAGGACATGGAAAAGGTGGTGGAAGAAGTTTTAGGGCGCCAAGACCTGTTTGATCACAAGTCAGAAATAGCCA 
AAGGTTTGGCCC GTT TCG AT ACCC AC GTCTCGTTG CTCC CTTCGGAT AGGTCGGCTGTTC T AGAC TC GT CCATAT CC AAAGAGTTGG AAAAAAAT AGC AA 
GGGCCCGAACAGTAATATTTTTGACACACTAAACACACTCAAGGAAGAAATCAAAGAACTTTT^ 

CCGGAGGATGCAAATTTTGTGTTCAATAGTTCTGTAAAGTATGTAAAGAAATCATATCAATACTACATACAAACATCAGAGATGGAAAGTGATGAATTTA 
AGTCCCTTCTCACTGGAGTCAACATTAAAATATTGGAGAAGATAATCTCGTCCGATAATAATGTTGCTACTCCTTACAAACACATCACTAATCCCAGGAA 
CATTATTTCGTCTTTACAAAAAGTACGTGAAACTAAACCTGTTTCAAAGGATrATCCGTTCAGAGTGGATACGGCCAGAGATATTGTACTACTTCCAGAG 
ACGGGTGGCATTTCTGATCTCCCTATCAAGCCCGTTACATTATTGCAGTTGGTGTCCTACATCAACGCTCTCTTTTCCCTCGAGCGCCGAAATGTTTTCA 
CCGACGGCTTTTTTAATGCAGCGTGCGTCCTAATTTCCCAGTGCCTAACGAATGCGAACCTATTATC^ 

TAATGTTACrrCGCCATAATCTTCTGAGTATGAAAATGCTTCAAGAAGGTTCATCCAGTGAAAAGAAGAGTAAAAAGAAGGAGAAAAAGAAGGATAAAAAG 
AAGGGCGGTG^TGGTGGTGACGATTCTGATTCAGAAACAGATTCTTCCTC^TCATCATCATCTTCTTCTTCTTCATCCTCCTCCTCCTCATCCGAAGACG 
AAGAAGAAGAAAAAGGAGAAG<:AGTAGAAAAAGGCAAGAAAACTAAACGCAAAACAAAAAAGAAACCATCAAAGGACGACGATTTAGATACAAT 
ACTGATTCTAAAAACAGGAGGTTACTTCCACGACACGAGTGAACTCGGCAATAAAATTAGAAATTTAATAGACAAG^ 

TATGCAGTAACAATCACTGAGATGCAATCTACGCCAATGAATCAAAGATTAGTATCTAGTCTTTTAGATTTGATAATGAGACTAAAAGAACAAGTAAA 

ATAGTGTTGATACTGAAAGTACTTCCTCCACCGCCAAATCTAATAATGCTTTAGATAGTGCTAAATTGACATCTCAACAAGTGGTTACAATGATGGTGGA 

TTCTGGAGCTGAATTGGCAAGGCTAGCTGCCTTCTTTTTCGTGGTGGTGGATAATACTGTAGTTAATCGCC^TGA^ 

TTACCCTCGAATGAAAATAGAGGACTTAAAACTGTTGTAGAGTCATTCTTTAAAAATTTAACAATTAGTAACAAAGTCTCTACCTCGAATGA 

TGTCGGTGATGCCGTTCGAAGACGAACAACAACAACAACAATGCCCTCAACATGAACAACAACCGGATTTGAAAAGAGTGGTGGGAGAAGTATTXCTAGA 

AATGGGAAAATCAATAGTGAACTCATTCCCTTCCAATAAGAGTGTACAATTAACGGCTGACGCGTTCAAGCAAAACTACTCGCCTATGGGAAGACGCATA 

AATTTGGCGGCCAAGATAAAAACGGCTATATCTATCGGATCTAATATCTCGCCCAATATACTATTTTCTAACCrrCCCAGAATCTGTAGGGAATAATACTG 

TAACTGGTCTAAGGCTAACCAACCTATTGAAAAACATATCCCAAAGTGCCC^GCTAATAATATAATCAAAAATGCCAACACGCTCGTAAACAATACGAT 

GGACCAACAAAACTCAGCAGCCATGTCCATACTACTCTTTCCTCCAACATCAAAAGAAACTTCTATCTTTCCCGGAAACGATCCTTCATCTATAAAATTA 

CAGGATATGACTACAATGTCAAATCTGGCACGAG<3ATTTTATTCCATCGCAGAAGGATGTATCGGGGTCGTACGCTCGAGGGAATTTGATGAAGGAGGAG 

TTAAAGCGTACACTTTACTAGTGGACTCAAATACTATGGACATGGCAGTTAATTTTGCCGCTCAGTCTCTGGAAAAATCAATGTCTGAAGCCTTAACAAA 

TAATGCGAATATGAACCCTTCTAATGTATTAGAAGGAGGATCTTTTGTAGACGGTGCTCTTTCTTACATGTTTGAAAAAAATGGCAGTGACTGTGAACCC 

ACCCCTCTAGCAAAATATACAATGAAAGATGTATCAAATCGATACTTGAAAAAATTCAACAATGATAAAAATACACAAGATTTATATAAAAATAGA 

AAAG A GC ACTCGTC GAAC AGGTT AC AAAT AAGCCT AC AAGTGTTGTGC ATTCCC AGC T AGC AAACGCC ATGGGGGTAGCTGTT AT AGGCGCAGC ATC AAT 

CAAGCTCATGGAAGCAGAAGCAGCGGAATCTGAAATGAGAGCAGCAAATTATCAAGCAACATCAAAATCTACAAATGCTATTAATATAACAAATACAATT 

GGAATGATACGTAATACTACTCACCTGTGTACGACCATCGCCGTGAGTGCAGCCGCCGACATGTCAAAGCTCGCCAACAACCATTTTATGAGTGTATTAA 

ACACTGCAAATAACAGTCATAGCAGCAGACGAGGAGACAGATCGTCCCTGATGTTGCAGCAACAACAACCAACACATTCAGCATTTCTGGAACAGACAAG 

AGGAAGAGGAGGAGGAGTATTAGGATCAGGAACTGAACAGACAAAGGATCATGTGGAACGTATGAAGAGAGATTGGATATTAAACATGATATCTCCAGAA 

GACAAAAATACTACTACTACTACACCCAGTAATGCCGGCCGGACATTAGGATATGGATCTAATATAACTGGCATAAACACGATAAAACAAGACGACAAGA 

GTATGATGGATAAACTTTCTGAGATGTCTAGCTTCAGAACTTAAAGTGCCAATTTTCATATAAAAATGTAGTATTATGTGGTCGTTTTTAAACTTTAAGC 

GTTTGTGGTTTTGGGTCGTTCACGGTCCAAGATGTAAGAGTCCCCTTGTACCCAAATTATAACAAAGTGTCAGTATGGTCAATGGAAAGTATCTAAAAAT 

CCACCCGTCTATCATTCAATATCCCAACTGAACCAACCAACCCCAGAACCAACAACATGTCTCTTGTTGAGAATAATACCCAAGAAGAAATGATTCTGGA 

AACTACTGTTGAAGGTGTTGTGGAGGGAGCAGAAGTTGCCCCTAGAGGTGTCAAGAGACCCCTTCCTTCTTCCTCATCTTCTTCTTCAGCTTCGGATAGT 

GAAGATGATGAAGGAGGAGAGCAACCACAAACGAAACCCCCAAAGAAGAAACGTAATATCAATAGCGGAAAGTACTGGAAAATCGAAACAATCGAACCAG 

CATCTCCAGAAATGCTCAGTGCTGTAAATGATAT^rGATAACGTGTCCAAAACTATTCCTCTAATCGATAATAGTTTTGGCGTTCAGTTTAAAAAGAGTGT 

ATCTGAAGAACAGATCAAAACGCTACTCACTGAAACTATTGCCGTGGAATATGGAACAATCACAAATGTTAAATATTCAACCTTTAATCAACTCGAAAGG 

ACCGGAGAGCCTCTTAAGAAGAAGCGCTCAAACAATGGAAATAACAACTACAGATACTGGCAAATTCGAATTGAAGCCGCTGCAGCAGAGAATGTTACAC 

AAGCTGTTTTGGACGC^VATCGTTGAAGGAAATGACACCGTGATCAAGGCAATCCTCCTCCCTGAAGGGGAGGGAATCGGGCTTCAATTTAACAAGAGTGT 

TAGCTCTCAACAAGCTAAAAATATTGTCCAGGCTGCCGATATTGAATTTGGACAAGTTGCGCACATGAAGTGTAATTTGTTCCACAAGATGGAGAAGGCC 

GATGAATCTTCTAATTCTTCTGGTGAATCACCAAAGGTTAAGAAAGTAAGGAGGAACAAGTCCCAGCCTACAAATTCTTACTACACTTTCACTATGATCG 

GGGATTCTCTTCAAGAGCGTATTGATAACGCAATCAAAGTTATCGAAATGTCTCCAGTTAAGAGGCCATTCTCCAATTCTGCTGCTGCTGCTGAAGAAGA 

CACCACTACTACGACCACTACTTCCACTGGTGTTGTTAATCCACGAGGGATTAAGGATATTCACTTTTTCGATTCATCCATTTCTAAGGGATGTTTCACT 

GTGAGGAATATTGTTGCAGCAAATGGTGAAGTTCCACAAGAAGAATTCGTTTCTGAGTTGTACACTAACCTTCTAAAGGTTGAAGAGAAGGTGGATCATC 

CTACTTTCAAGAAGCrTATTCATGACCGTACAATGAATCGACACATTAAGGCATGGTACTGCATCTGCCCCTACTACACCACAGGAGGCGTCCCCCCTGC 

AGCTGATAAGGTTTCCGCTAAGGGAATTGCTACCTACAGAATATACGAGGACAGGACTGGAGTGTTCCAATTCGATGGGGCACATACATCTACTACCCCA 

GCACAGGCAGCCGAGGCAACTGGTGCTATTCACAAGTCTATGCTCTTCCAGAGTCCTGGAACTGATATTCAAAAGTTCCTCGATGCTAAGAAGGCAGAAG 
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GTTTGGAGCCTATTTCATCGGGCGAAATTGTGTACCGTTCAAAGTGGAGCCCCAATGATAGCAGGGCAACACGCTGCTTCAAATTTTATTCCTCTTCAGA 
CGAGAAGATGAACATTGCTGATGTTTTGTCTATTGTTCATACTGATGGACTTTTCAGTAGTGTACACTTTAGAAAGGATACTATGGAATATGGTGTTGCA 
AAGAGCAAGAGTAAAATTATCCCCAAAACTATCAAGATTAAGAAGGGAGGAGATACTTTCCATAGCGAGGAGGATATTGAAGTGCCTGTiW^ATTCACTG 
CAATCACTTCGGAGGAACTTAATAGGGAATGTAACACCAAGGGAATGAACAGCCTCCGAGCACACAAGAAACGAAAGAGCAACTCTTCTACTACTACTAC 
TTCCACCACTTCCACTTCAACAACAGCAAATACTCCAAAGAAGACGAAGAAGAGCGCTTCTGCTGCCAGTGACCCATTTGCAAAGCTTACTTTAGATTAT 
GTTGATAGTACATCATTTGTATTTTACAATATCAGTAAGGAAATGGTGCAGAGAATTTTGGCTCAGGAGAGAGTAAAGACGCTAAAGGCAGTCAAGAACG 
AAGAGAAGATGGAAATTGTAGAGGGAGAAGAAGCACAAGAAACTTATAGAGGAATCGTAAAGATCAAGACGAATGCAAAGGCATACAATCTTGCCAACAA 
AAC AT CTGG A GT ACTTTTC CCTGCT G AT AAGGTGTGT CT AAAGC AC AC ATTGG AAGATTTGGGGGAT GT ACTTG ATTTTG ATGT T GT AAGGG AGGAT AAT 
GTTAATAAGACGGTCGCTTCTACTACTACTACATCATCTGAAAATAAGGCAAGCGGAGGAGATGATGAAGAAACTCCAATGGAATTTGAAACTGACGGAG 
AGAAGCTGTTGCACGAATTGTTGAATGAATAATGTTGTTGTGTTGTATTTTGTATATAGAGAAAAGAATAAAATAAAAGGTTAATATAATGTTGTCTTTT 
ACATTTTCCTAAACTTGATATAATAAACTATATAGCAAATATTAATGCTATTGACACATGTTACTAGTCCCTAGTCAACGTGTAATTGTACATAGTAACA 
TGCCTTTTCTTAGGCATTTGAGGCAAATAGGAAAAGAGTATTTACGATGGGTGACCGTCGTGATAACGCCTACACCAGGTGACAGTATAGAGGGTTTCAG 
CAGACTGCAAATACTCACCTAACAGGCCTTGGTATAGAGGAATCAATAAAAAATCAACAAAAACCTTAAAAGTTTGGTGTTTCATTTTTTTACATCGGGG 
AC AG AAATGGG<STT AAAAAC AAAT ATCCG AGCTGCAATCT GAC ATTGGC TCC CT ACCC AGAGGT CC ACCCTCCG AACTTG AC AT TC AGTC G ACCC AGCGG 
TCCACCCTCTAAACTCGATCGAGCTGAAAAATTTTTTGAAAAGTTTTTGAGATGGAGAATGAGTAAAATTCTCTAGCGGAAACAGTAGGTTACACACTAC 
CCTTTCTGGGTACGTTCATATCCAGAAACGCCTGTTCCAGAAACAGTAAAGAGTTAATGTATGTTTCTGGAGCAACAGACGTTTCTGGTGTAATCTGACA 
TCGAGTCGACCCAGAGGTCCACCCTCCGAACTTGACATTCGGTCGACCCAGCGGTCCACCCTCTAAACTCGATCGAGCTGAAAAATTTTTTGAAAAGTTT 
TTGAGGAGGAGAGAGGAGGTAAATTGATACAGATAGAGGTATCTTTAGTAGAGGTGGTGGTCCTTCTAGGAGCTGGTGTCTCCAGAAACGCCTGTTCCAG 
AAACAGTCAAAAGTTAATGCACATTTCTGGAGCTATCCCATTTCTGGTGCAATCTGACATTGGCCCTACCCAGCGGTCCACCCTCCGAACTTGACATTCG 
GTCGACCCAGCGGTCCACCCTCTAAACTCGAGTGACCCAGAAAAAATTTTATAAAATTTTGAGTCGGAGATAGAAGGGAAATTCTCTAGTGAATACAGTA 
GGTT AC AC ACTACCCTT TC GGGGCGC AGC AAGTCTCC AGAAACGC CT GT TCC AG AAAC AGTC AAAAGTT AAT GTGCGTTT CTGG AGC AAC AACCATTTCT 
GGTGTAATCTGACATCGAGTCGACCCAGAGGTCCACCATCCGAACTTGACATTCGGTCGACCCAGCGGTCCACCCTCTAAACTCGAGTGACCCAGAAAAA 
TTTTTATAAAATTTTCGTCAAGGGAAAACAACACTAAGTTGCTTGTCAATTCATACACTAGTGCAGGTTGGTTGAGATAGAGAGTGAAGGAGGCTATCCA 
GATATCTGGATCAACCAGAATCCAGAAACGTTTCTATCCATTTCTGGAACAGCCATTTCTGGAAA 

GGT ACCACT T C T GGC AC AT TC GTGC AATCT GACATTGGCCCT ACC C AGCGGTCCACCCTTC G AAC TTG AC AT C AGGCCG ACCC AGCGGTC C ACC CTCT AA 
ACTCGAGTGAGCTGAAAAAATTTTTGAAAAGTTTTTGAGATGGAGAACGAGTAAAATTCTCTAGCGAAAACAGAAGGTTACACATACCCTTTCTGGTTAC 
GGTCATGTCCAGAAACGTCTGTTCCAGAAACAGTCAAAAGTTAATGTACGTTTCTGGAGCAACAGACGTTTCTGGTGCAATCTGACATTGGCCCGACCCA 
GCGGTCCACCCTTCGAACTTGACATCAGGCCGACCCAGCGGTCCACCCTCTAAACTCGAGTGAGCTGAAA 

CGAGTAAAATTCTCTAGCGAAAACAGAAGGTTACACCTACCCTTTCTGGGCGCAGCTATATCCAGAAACGTCTGTTCCAGAAACACACAAAGGTTGATAT 

ACGTTTCTGGAGCAACAGCGTGACAGTAGTTTCTACTAAGCCTTCACTTGTACACATTTCACTTAGTATAATTTGTCATCACATGGTCGACAAAAATTTT 

ATAAAAATTTTTCTGGGTCGCTCGAGTTTAGAGGGTGGACCGCTCGCTCGGTCTGATGTCAAGTTCGGAGGGTGGACCGCTGGGTCGACTCGATGTCAGA 

TTGCACCAGAAGTTGTTGTTGCTCCAGAAATGGTGGGGGAGTATTCAGAGTGTGTTTCTGGAACAGGTACACATACCCAACTGATCAGATAAAAATTGGG 

TGTATACATTTTCTGATCCGAGAAAGTAAACC^CTGAACGGTGTAAAACGTACTGAAACATTACATCACCCACAGTCGTCCTCCTCCCTTCTAAGAGAG 

AACATATCCCGTACCCAACTACCTCAAACAACAACAACAACAACAACAACCATGACATCCATTATGGACGATCTCTACATGAGCTGTTCTTCTTCTTCAT 

CGTGCTCAAGTTCATCTGAAGAGGAGAATGAGGTAGGAGTAGAAGGAGGAGGAGGCAGAATCGGACCAACAGAGGCCAAGAAAAAGATCCTCCGTAAACG 

AAAGAGATCTTCTGTTAAAAGTACATCATCTTCTTCTTCCTCATCATCATCATCAGACGATTCTGACTCTGATAGAGAAGAAAAAGAAGGAAGAAAACTA 

TACGTGGATATTGCAGATACAAGGAAACCGCCO^GTAAGAAAACTGGATACTCCTTCACAAACTTTAGAGAACGACCTCTACATGTCTAGTTCATC 

CTTCTTCCTCCTCATCTTCCGACTCATCATCGTCTTCTGGTGAAGAAGAAAGTGACGATGATGATGATGACGATTATGACCCAGATAATGTCCATGTTCT 

TGGTTGTAAGAAGGAAAAATCTCCCCAAGACATAGAAGCTGAAAAGGAAAAGGAGGAAGAGTATGAAGAAGAATTCAAGAGAATGGCATTACCTTCACGG 

ATAAACACATCTGTAGATGATTGTGTTATACCTGATCGGATTTTAACCCTCTTTTCTACACTTTTGAAGAAAAATAGTTTCCAGTTCTCTCAGCCAGTTT 

CTTTCCTCCGGTTGGTGATGAAGCAAGTAAACGAGGCCATGAATTCAGCATTCTCTTCCATGTTATCCAGTTCTGGTATGAGGTTGGTGGAGGACTCTTT 

GGGTG AC AC ATCT AAAATT TCCTC C TT T AT AACACCT C AAACGGAT ACT AGT AATTC ATCTTC ATC ATC GACGTTCGTC A AT AATTGTAC AG ATG AAGAT 

ATCAAGAAGCGTAACATTGCAATGGGAAGAGTAGCAGAATTGCTGTCAAATATTGCAGCCAGTTCTAATGAGGAGAATAATTTCCGCCCTGTAGTTTCCT 

TAATGCGAGGACCAACTTGTGGAG^TTCTAATGCATCCAACAAGAAACTCAATAGTAACAGGCAAACTATTCCCCAAGTATTAAACAAGGTTATATTTTT 

TAGAGAAATACACAGTGTTATCGCGCTATATTTATCGTCCGTCTGTGTGCAGCGAGCAATGAATAATGACAACACAAATTCTAGCGGATACGCTGAAGGG 

ATGGTTACTAAAATCTTGAATATTATTGGTAAAATTCCTTATAATGAAATGAGTAGAGAAAAATTCATATCCGTTGGAAGAGATGCACTATATTTGTACC 

AGAATGTGATCACGGATATGACTGGCCCCAAACATAACAAGAGACTCCGTATCCCTCAACAACAAGCTGATTTTTGTTACATTATAGCCATGTTGGTTAA 

TGATGTTCCCATTACCTCAGATTTACTTTTAACTGGAAAGGCGACAAATTTAGTACAATTTGCTTCTGCTATGGTGGATCCTGCATATCGCCTGGCTGTC 

CATAAAATGGCGTCTGTTTTCAATAGTAGTTATTCCGrATATAAAGTCCTAGATCTTGACCATAAAATGTTATTAAGGGCTAATTTAATCCTATCTATTT 

TATCAGCTAGAAATAAGTGTCTCAGTGAAAGAAAACCrAGAACATTAACTCAGAGCGTGTATTTGTTCCTAAATCATCTTTTGCGGAACAAATTGAGGTC 

TAGTGGTTTGACCAGTGAAGAGAGTTCTCTAGGAACAGCCGTAAAATTGGTGTCACAGCAATTAATGTATGAGGGTGTGACTCGTCAAACCATCGAGGAC 

GGATGTAGCATGATTAGCGGAAACTTTGAGGATGAAGACGGTGTAACACTGAAATGTTTGGGAGCCGATGTTAAGGACGTGAAGACTGTTGGACTATCTG 

C ATT GCT ATC TGAT AGGCT T AG AAAAAAC ATT AGAAG AAACGTCCCTTTCT ACTGAAAAAAT AT GGAAACTACT ACGCTC GCTTTC AT^G C AATGGGTTT 

GCTTTTCGTGGTGTTGATGGTGTACATCTTTATACAGTACACGTCATCAAATAAAAACAAGACAGAGGAAAGAAAGGGTAGTAAATTATCATCATCATCA 

ACTTCTTCTACTTCTGGTCCAACATTTAAGAGAGAAGCGTCTCCTGAATTAAGAGAAGACATACCACTTTCTCCACGATCTTCTGGTCCAGAACTGAATC 

TAACTGACGCTGAATTTATAGACCCTGAACTCTTTTCTGAAGCCAGGGTCAGTCCCGACCGTTAACGGAAAAAAGGTGCAAACCGGCAACTCTTCTCCCT 

CCTTTCGAGTATAAGTAGAGGCGTTGCACCGAGCCAAGCATCATTCGGGCTCATTCAACTGTGCTGACAACAACTAAGAATATACATTGAAATAACCCAA 

ATAATCAACTTTTCAAGATGTTTGGAAGTTCTGCTAATAACTTTAATGGTGACAAGAAATCTTCCTCATCATCATCAGCTGCCGCATCATCTGACGATCA 

GCAGTTAGGTCCCCTTGGACTGTCTACTGCTGATTTCAAGAAAGTTGCTGCCATTCTTGCCAACAGAACAGAGTCCCTATATCTGTTGCCAGATTCTCCC 

AACTTTAAGAATGTGATTAACAACCCTAACCAGATATCTATTGTGCCGTTTCTGGGT^^ 

ACCAGGCTGAAAATTCTTCTAAAGGAGGAAGTGATGGAAAGAAATCATCGCAGCAGAACAAGTTTAAC^ 

CTTTAAACGTGTGGCTGAACTCATTGCGGATACTCCTCCCTCTAAAGATAATCCATTGAGGGATGACCCTGATGCTATTCCATCACGCAACCCATGGGTC 
AAATTGACTCAGAAGAATCTGGAATATCTTTTCTGGGAAGCAGTAACAATTGAAGTCTCCAATGATAGGAGTATCCGTAGTGGAAGATATCTTCAAGCCA 
GTGAAGTGGGGGAGAATCCATTCCTAATGACCATCAGTGTTGACATTAGAATCCTGCAAAGAATGGCACTTAATGTCGTGTGGTTCTTCAACAGATTCTT 
CCGCATGGTTTCTGGACTTGGAGTAGAAAACAGGGCCAATTCCACCTATGTAGCAACTAGCGATGCTATTGCCCAGATCTGGGTAGAGATGCTCCTCAAG 
AACTTTATTTCTGGAGAAAATGTGCCCCAGGCACTGAAGTATTTGAAGGAACATTATGAGCATGTTTATAACAAGATCAGTAAATGTGGACGTCAGCCAT 
C C T ATTTTGTTGTTGAATTTG AAAG AG TGG AC AAC AC GATTGGATTTGTC AAT TCTG AT ACTG AAC AT AATG G ATC ATC AT AC ATGG AAT AC AGGTGCTT 
TGACACAATCAGGAAGAATGCATCATCTGGGCCAAGTGGAGGCGGAAAGAGTGGTGTCCTGTCTTCTGGAACATTCTTCATTGATAACGAAATGGGGAAT 
AACAATAGTAGTGCAGCTGCAGCTTCTGCCCCTGCTGTTTCTGCTGGAGTTTCTCCATCACTTTCTCCATTTAGCAGTGATGGAGATGATGATGACGACG 
ATTGTAGTGGTGATGACGTGTGGGGGAAGAAGATGATATTCAACACATCAGGAGACGGATCAGGAGAATCTTCTGGGCAGAACGGTGGTGGTGCATCAAC 
TTACAAGAGATTTAGGTGTGGAGAAAATACTGCTTCTCTTTCTCAGAAGGAAAATGTTCGTCTCATGGCGATGCCAAAGGGAAATGAAGATAAACAACTA 
CTCAAGAACATTATCAACTTTCTGAATAGTGCACTTAACTCTGTTGAAAACCATGTAATGTGTACAGATGAAAATATCTTCGATGAGGATCAAGCCGAGC 
ATTATACGTCAAACAAGGAACTATACAAGGCTATTGTTTGCTCCAACCCAGCCAATGTGTACAGAGTAATGGTTGAACTGTTTGTCAACCTTATTCTTCC 
CCGTCTTAGGAACCCAATTGTGAGTGACATTGAAACTGTACAAAATCTTCCTTCAAACAATGGATCGGTGAGAACAAAGAAGATGGTTGAACATGGATGT 
ACAGATATGCGCTATGACATCCCTCCATATGCAAAGGGAAAGATTCGTCTTAGTGCCAAGAGAGCTTGTGAGTGCAGAAAACTGTGCAAGGACGTGAGGT 
GCTTTGACAAGTCTAGAGAGGCAAATCTCACCCCTAGCCAGAAGGCAGGAAGGGAGGTGGAGGAACCTTTTCCCCGCAACCACAATTCTCACAGGAGTAA 
CGCTCACGACTTTACTTTCTATGACAAGTACAGGGCAAGGATGAACAAGCTCAAGAAGGATTCAAAGAAGAAGGTAAAGAAGATTGACACCTTTACAACA 
ACAGACGATTTTCTCCTGCAAGATAGGAACGCTTTTGATCTACTTAGAAAGTGCTTCCTTTCTGCCTCTCTTCATCACATTTTCTGTCCTGATGTGCTTA 



FIGURE 2, Sheet 21 




era :i 9 jy^4< & . O <«4-0 HO 2 




WO 01/38351 



PCT/US00/28888 



23/201 



TGGTGCATAGAGGAGACAGCTTCAATATTAACTTTGCAAACAACAAACTCGAGTGCTATAATGAACGTAACGGAATTGAAGAAGTTACTTCCTCACAAAC 
GGTCAACGCCAAGGAAGCACTTGAGGATATTACAAAAATTAAAATGAAGAGAGGGGATGATATTATAGATGTTGTGAAGAGTAAAGGACTTTCTTTGAGG 
GAATTCTCTAAGAAGGTTAGTAAGATTGTGAGAAGGTTTAATGAGATCACAAACCAACTCTGCAACAACTGCAACGTTAACTCTTCTAATGGAGATGTGG 
ATTTCCACGTCTTTACTTCTGTGTGTGTCTACATCCACAACATTATTCCTGTGCTCGAAGATATCTCCATTTTTGCAGAATTGGGTGAAGAATTGACCAA 
GCTTGTTAAGGAGTGTAGAGACGTGGCTGGAGAGGACAAGACATATGATGATATTATCCGCAATTACGAAATTACTGTAAAGTACTTTAAGCTCTTTAAT 
GC AC TCGTT AAATTC TGTC ACAGGAAT T AT AATGT GGC AGT AACC TC TGCC ATT AAC AGG AG AGGGT AC ATGTG C AT GGTG AGC AACC TTGTCGGGT AT T 
ATTGTAAGCTGTCTGATAACGCTATCCAGTATCACGAATCACTATGCTCTTTGCACTCTAGCATCTCTTATGCAGACTATTATACGTCTCGCAATAACAA 
TTCTGAAGATGGAGGAGGAAACTCTTCTTCAGAAAAGAGCAATGCAGATGTAGCCAAGACTATGGCCTCTTTCTATGACCAGTTCGATAAGAGTGAAGAC 
AGCAAGAAAAATAAGAACAAAACTTCAAATGAGATCCTTATAAAAATGTTCCAAATGGATAGGGTTTTGGATGGCATGGATGATGATGATGATGAAGATA 
GTGATAGTAGTAGCAGTGAGAATGAAGAGGAGGAGGAAGAGGAGGAAATTGTAAAGAAACCAGCAAAGAAGAGGAAAGTGGAAGATGTTGATAGCAATAA 
GAAGACACTGCCyVAAGGAACCTGCCGTTAAGAAGGTGAAGCAGGAAGAAGATGTGGAGATGGAGGAAGTGAAGGAAGCAGCAGCA 

GAGGAACAGGAGGCGAAGGAGGAAGACGCTACTGAGTATGACGACGATACAGAAGAGGACGAGAAAGCAGTAGCATCTGATGAAGATGAAGATGATGAAG 
ATTCTAAAGCTATTTTCTAAATATCATGTGTATAAAATGTAGTATTTTAAAGGTATAAATAAACACAATATAAAGTTAAACATTGTTTTTTATTTGACAC 
ATGTACATCATCJ^TTTCACATGGTTATATTTCTTGTAGCTTGCTCGGCAGCAATAAGAGCAAGAGAAGAGTCATATAAAACAGATGTCTGTGAAGCAGC 
CATGATACCCCTCGATATATTATTAATAAGGGGAGTTGCTGCTATATTAACAGCATAATAAATCAGGTTCTTCCAATCCCTAGAAGCCCACACATTGGGC 
ATTTTTATATAAAAATCAACGTCTACTATTGACCTGGCAACAGCCACTATACCTAGCTCATCAATGTTAACAATCTTCCCCTGAACACCCATTACAATTT 
TTGATTTTTCTGAATTCTCTCGTATAGCAGCAATACGGCTAGCTTCATTATTTAACCATTCCAATTTTACACTTGAAACATAAGCTGGGGATTTTTCATC 
AGTTACTTGAGATACAATATTATTCCATTGAGATAAAAGTTCCTCCAATTGAGTTGTAGTCTTGTCCGCTTGCTCTTGGAAACTTTGCATGACGATAGCC 
GAATCTCTTGT/PGCAGTTTCGATGATGGAAATACAGTTGAGCCATTGCATGTCAGTAAGGATATGGCGCGGATTAACTGGAACAAACAGATTCTGTGCAG 
GAGAAAAGTTCAAGTCTTCTGGAATCTTGTTGAGGTTGATATTGTCCTTGGTGTGTTTCCATGGGATAGGTTTCTGTGCCTTTGAAGCAATAAAGTATGA 
TGGGATTCTACTGTATTTCTTTAGTGTTTCTGAAATGTTTGTCTTTATACGGTCCTTTCTTCCATCATCCCT AATGT AATTCAAAAGTTTAGATGTGAAT 
AGAGAAGAGTCTCTAATTTTTTGCATTTGTTCATTGTGAACCTTTATATCCTCGGGAACAGTATTGTTATTATTAGCCTGCCTCTGTTGACGCTGCTGCT 
GTTGTTGTTGTTGAGTAGTACTTCCAATGGTATTTAATTTGTCCAATATTGAACGTAATTCTACAGGCACCCTTCTACCAGTTCCAAGAGCACCTCCACC 
GCCGATGGCCGCTGCAGCTTCTTCACTAGCTTCATCATTTATAAAAAGTCTCCCCGAAGAAGCATAGCGTAAAGATCCATACCCTTCATTTCCAGTATCA 
ATAGTAATCCCATTTTGTTCAGCTGTTATCCTTTCCAGTATTTCTTCAATGGCCTTTCTAGGATCTAGTGCAACATATAATCCATCGACCATAATATGAG 
ACACGTGCTCGCACACTTCTTTAATAATTTTCATCTCTTCGTTGCTGACAAGCATCATCATTCTAGATATAATAGTAGCAATAACATTCCTAAATTTGTT 
CTGAAGAAATAGTTCGATCTCTGTGAACGTAGGCTCGAAACGGGCAGGCACATTAACTGGAACAACCGCTTGTGTTTCTTGTTGTTGCTGCTGTTGTTGT 
TGTTGGATCGCTCTTCTTGTTCCACCACCTACTATATTATTACTGCCTCTGAGTTGGTCGGCTGCAATAGCAATGATGTTTTGTACACCTTTACCAGATT 
CTGCAATTACTTCCATTTTTGAGTCTTCGCCTGAAATTGGTACAGAAGGAATCTTGGGTGACAACCTTTCTAGAGTTGTGGGTAATCCAGTATCATACTG 
AAGAACTTTCCAAATAAATTCGAATCCAGTTCCTGCCACTGTGCCCACTTTACTGGGGAAAATATCGATCCTCCCTGTAGATAGAGGACTAACACATTCC 
TTATAGATCCTCATTGGGCTTAAATGTTGTGTCGATTTAGTTTTTGAGGCTTCATAAAGCACAGCAGACGAGAGAAGGGGTTCAAAGAATACAGTTTCCA 
TGCCCAATTCCTTTACCGAATATTTGACTTGTTTAGATGCATGAACAGGAAGATTTGTGGGCTGAGAAGTCGTACCAGGGAGGAATGCGCTAACATGTTC 
AATTTTCCCATCAACCGAAAAACTTGTCCGTACACCATCAACGAAT 

CCAATTTCCATATCCTTTATACCATCATCGCGTTCAGTATTCTTTACAACTTTAAAGrACGACACCAATTCACCTTTATTTTTCAATCCAAGGAATCCAT 

CTTCTGCG AT ATTCT GACC ATATG T C AC ATTTCTTGTT AG AG AT T TAGGT GGT AC AT AAT TT AAG AACG ATTTTG AAATGGT TTC AG AAG AAAATGG AGC 

AGTGGCAGTTGTTTGCTGCTGTTGTTGTTTTTGTTCCTTAGAGGATATTAGAGGTACATTAATGAAAAGTTCCAGATCCTTATTCATCTTTAAAAACCGT 

CCTGTTGGTACAGGAAGATTAACTACCCTAATAGGGGTTTCTGATGAATATCTGGAGGAGGATGATGGAGAAGAAGCGACTGAAGTGGCAGACTTTAAGT 

TTTCGGCTACATTATCTCCATTCAAAATACTCTGAGGTGTATCGGTCAGTGTATAGTATCTGAGAGACGTATCATTTTCTACAATAAAGGGAGGATCTCT 

AGGGAAAGCAAG^ACACGCCTCCATTATCGGCTGCCGAATGAATAACTGCATAATGTGTAACAATCTTTGCAATTAGATCCATCACGGGCTTCTTGAAT 

AATAAAAATTTGTGTTGATCAGGATCGGCTGTATATTTTAGTAGCACTTCACGTACTATATAATCTGTTAGTTGTCTAGCCTCCTTTACAGCCCATACGC 

CCGATTTCAAAACGCAATCAAAAAATGGAAGAAGAACAACTTGAAGTACCTGCTGGTGCATGTACATCCAAGCG-GTCTCATCCACTTTCACTGGCTGTCC 

TCTCATAATATTTCTTGTCATTGTTTCTAAACACGACCGATAAAGACCCACCACGCAGTTAATGGGATCGTACCATCTGTACTGCCGTCCATAACTTCTT 

CTTCCGGCCAGTATAGGCTCCCCTGGGAATAGTATGGAATCGCAGTTAGATTCTCTTCCAATATTGTATATGGATGCAATCTCTCGAGCAATATCTCCTA 

CCGTAATAGCAGATCCGCCCGAGAAAAGTCCCACAACAGCATCAGAGACATTTGAGAATTCCCTCCCTTCTAGATTTACATTATTATAATATATCAGAGA 

TGCTATTATAGCAGCAGAAGATTCCCCAGAATCGTCTCTCAGAATTAATTTTAACGTAGGAATATCAACATCTGGTATAAAGGCTGTAGTATTATTTTGA 

CCGAATGTAAGAAACCTTCCAGAATCGTGTATTTCCGACACGTTACTTTTTCTGTCGACAACGAGTGTTTTATTATGTTCAGCGTCGTTAAGAGAGGTTA 

AAAAAAGGTGCATTTCTATCCAGTCTGTTTTCTCCGATATACAGTCCATGATTGAAGTGTATTC 

AATGGAAGTAGGTGCCCCTGAAGGGACAATGTTGGTATCGGTAGAAGTTTTTTGCCTCTT 

GAAGATACATCTCCTAAAGATTTTCCTTTAGACGATAAAATCGATTGTGCGATTTTTTCCTTTTGTTCGGTCGTCAATGAGGCGGTTGAGGAGGAGGAGG 
AGGCTGCTGTTGCTGTTGCTGTTGGTTCATAAAGTTTGGGTTGAGGTTTATTTTTAAATGATAGTTGATAATACCGTGTAACCAAATCCTTTAAAGAATT 
GGT ATTTTT TCC AAAAAAGC AC TGGGCC GT AAAGT AGTGC AT AC C AAATTC ACTCGAGT T AAATT TTG AC AACTCTGGT ATG AT A AT GG AAG AAACTGAC 
TTGGAGTTTGAAGCCATGTCATTTAAATCACGTAGCCTTGTAAGAGAGG^ATGTTTGATAGAG 

ATGCTATCTGGAGGGGGGTGTTTATCCACACCTTCACATCCATTAGACGAAATATCACAGGGTGTTGTAATAATTTCCCTGTGAATCTAGAGGTCGTAAG 
AACAATATCAAGTATGCCGTGATGGGCATAAAACACACCATAGGCGTCTTTCAATACTATTGAGGAAGAAAGTCTTTCCATTTCCCATTCTTCTTGATTC 
AGATGGGATATATTGACAAGTTCTTCCATGCCGTCGTTTCCTTTAAGATCTAGCCGTATTTCTTTCTCATCATGCATTGTCCAGTCCTTAATATTAATGA 
CAACAGGCAAAAGCCCTGCCAATCTCTCTAAAGACGAAGCAGTTTCTTCCATGTTCAGCTTGATGT^ 

ACGAGAATAACATCCTCCTCCTCCTCGAGGAAATTTTTGTTCTGTCAGAGATGAAAATCGTCGTGCTTCTCGTGCTCTTTrTTCCACTAATGCATTTAGA 
GATTTTGTAGATATTTCTTCAATAGTCATCTCTTTATTCGTCTCTATAATAAACACAGACGGGTGTATTTTTACCCTTATTTTTGCATTTTCTGTTTCAC 
TTCCTAAACGCCCCTCGGTGAGGTTTGCAAGGGTAGAATCATAACACGCAAAAGTTCCCGCATTAAC^ 
TTGTTGTTGTTGTTGCTGTAGCCCTTCAGCCAATAGXsCAAGCTAAACGGCTATTGTCTTCG 

TGCGTTGTTGACATAATTTCCTCCTCGACGTTTCTCGTCAGCCTGTAATTCGAGTAGAGTTTGTTTCCTTATACGGGTGTGAACCAATAGTCGGTAAAAC 
ACCAATCCACCGGTTACGATGATGGATATCAACAATATATAGAATAGGGCCTTGTCACTAGCCTTGCAGCATTGTCTCTTTCTGCAGCTGCAGTCAAATC 
TTTGTAGTCTGGGCCTAATTCGCTCTGTACGTG^TTCCGTATTCTCTCAGACACGTCCATGTTGTTGGGTGGATAAAGTATAAATACTCAGGTGTACATA 
AAAACCTCTCACATGATGGCAGGAATCGGTAGAAGAGATAATAGACCCGTATTAC^TCTGGACATTGATCCAAATAAGGAAATACCCTATAACGTTCCAC 
CAACCCCTATTATCTGTGAAAAAAACCCCTTCGTGTTTAACATGCAGAAGTGCTCAGACTGTGCTCCTTTCCCTCCCTACCCCGGCACTGAGAAGCCTTT 
CCCTCC AT AC CCTGGT ACT GC AGT AGAAGAGG AGG AG AAGC AAAAGGAAATTGAGGAGCTTCTGGTTGACC AATCTTTCC CTCCC C C ATTC CCTGG AAAT 
AAGCTGAGAGATATCCCCAGAACCTACCCTCTCGAATTTCCCGAGAAGAAGGAGAAGGATTTCCCTTGCGTTGACACTACCGGTCACAGCGATATCCCCT 
TCATC GATCT GGAGAAAAC C CC AC C CC GT AGT G ACGTTAGGC ACGGT T ACC ACTACTT AATC AAC C CC AAC AAGGTTGGGGAGCT T AACC AT ATC GTTGG 
TAAGCTCACTGAAAAGCAAGAAAACCTGAACAAATTGGTGTTGGATGTTGATGACGTTGTGATTAATCTGTCAAGCACTTTGAAGGAACTTGAGAAGCTG 
CGAGCTGGCCTGTGCAAGTTCTCAAAAAACTAGACGGTAAAGGTATGACCAAACCGGTATAACCAACTTTTTATATAAGCCTAATATTATAAATCAACAA 
CTAACAAACGCGAAAACTAACAATAGGAACCATGGCCTCATCATCATCATCCCCTGTTGCCCTCTCCTCTGTCGCATCATCCGTGATGATGGAAAGAGAT 
GAAGAAAATACACTGTCCCTCAGGAACAGGAATGTGAACAAACCAACACCTGTTAGCGCCGCCTGGGTGCCTGTTGATGAAGAGGATGAAGATAGGGAGG 
AAATGAGAAGACTTGAAGATTTTTCTTCAGATGAAGAAGACGATGATAATAAATCATGTCATTGTGACCATAGCGATGACGATGACGATGACGAGGAGGA 
TCCTTCATGCTTTAAG^GTTTTTCAGCTGGCCTGTGCTCTTTTGTGAGGGGTTTCTTTGGCTTCCTCAGGAAGTCACTTACCAAGAAACAGGTGTTCCTT 
CTTACAAG<:GCAGCCGTTGCTGCTATTTTTAAGACTAGAGATGTGGCTAAAACTGAAGAAGGCGCAGCAACCATGGAAGAAAATTCAACAGATGTGATTA 
CTGGAGGAGATGGAGATAGTGGTATTGCTGCTGATGTTGTCTCTCTCGCTAGTGAGGGAGAGGGAGAAAATGGATCTCTTTTGGAATCTATTGCAACAAC 
ACTCATCAAGACTACAATTGAAAATCTTGTAGATGGTGGAGAAGAAACCACAGAATTGTAAAATGTTTTTTGGTATACAAATAAAAGAATACATAATTTA 
T AT AAT ATTTT ATTGTTTCCTC TT AAATG AC AATTTCCT T ATGAT ACTT TTAGGTGAAGG AAT AAAGGT A GG AG AAGGTGTGGAAGAC AC AAT T GT AGTG 
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TTTCCATACACATGATTCTCCCACATGTTATTATTGTTAGGTGGGAATTTCTTAAACGTAACTTGGGTTATRCGTGGAGCATTGTTGTTGTTGTTACTAC 

TGCTCCTGGTTAGGGTGTTTTTCTTGTGTTTCATCTCCTTCAATTCCTGGATACACTTCTGGATAGATGGAGATrGTTGAGGTACTTCTGGCACGTATT^ 

GTAGACATGTTCTTCCACATTATCATCCTCTCCCTCCTCATTCTTGATATTGAAGATGGGAGAAGGTGGGTTTGAAGGAAGCTTCAGATGTCGTCCGTTT 

ACATATGACACGTAATCACCGCCATTGCTTCTAGGTGGAGATGGCGGCGGTGTTGGCAGAGGAAGACTATTGTCAGAAGAAAATGACGACGAGCAGCTCG 

ACATTGGCATATAGACACTTTCTTCCTCTTTATTTGTACTTGTTGGTTGTTGTTGAGGTGTATTGGCAACTGGCCTGGGTGTAGGCGGACTAGCATAGGT 

TGGACCAGAAAATGTTACCAGTTTTGTGACTTGAGGGGTAACATTGGCCAGATCGTAAGTGGTAGATTTGAGCATGGGTTCAGAGATGTCCAGGTCCTTA 

TCTGGAGTGTTGAACCGGACGTGGCTGATGAAACTGGATTCGGTGTTGCTCCCGTTCCTGTTCTTCTTATTCTrATTCTTCTCCCACGGTGAGCAAGAAA 

TGCAAGTAGTAGTTCTGTCGATGCACTTCACGAGCATGTTCATGGCCAGTGCGAGCCCAATAGTCATGGAGCAGCCGACTGCCAATCCACTCAGGAAAGT 

GACGAGGATAATGTTGTCGTTGGTGATAAGATTGTCCATCTCGAGAGATGTTTCTTCCTCGACTACCAGACGAGATATGATGCTCTGGGGCCCTGCTTCG 

TCCTTTTATACCGTCACTTGGTTAGAAAAAACGATAGTGTAGTTCACAGTATTTGTCCCTTCTGGTCTTGGGATTGGGGTTCTGATACTCTGCCCAGCCT 

CCAGATATAGGTGGATAGACCGAGATTCTTCCCTTGATGCAGTTTCTATTCCArCCAAGATATCTTCACCAAATAAGTCCTTATGCAAGGACACCAGAAT 

ATTGAACTCGTGAACGCTTGACTTGTTGATGATGCATAATTCACCTTGTTCGACTGTTATACATACTCCTTCAATAGAATCGTATAGCCCGGGGTTATTG 

GCACGGTCCTCTTCAGGTAAGAAAACTGCAAATATTGTATTAGGCACAACATCTGGCACGTTTACAGAATGACAGTAATGTTTATCTATAATGTCACCAG 

AAAAGTGAAAATCTATCATCCGACACl^AGAAAGGTTGAAGGAAAATCTTTCCGCATATTCTTGAGTTCCCGCAATAACTTCTTGTTTCTGGAG 

CTTTTTAGGATTATTTGTTGCCAGACTTACAGCATCGAAAATACTTTCTCCCATTCCTTCAGCGGGATTTTCCATCTTACTTTTGTAGTGGTTTTCAGTT 

CATTAATTGTGTTTCAGGGAGGCTATAAAACAAAACTCTTACGCATCTGTTCATTCAGTAGTGGACCAGACGTCGAGATGACGAGACATGGTGTGCTTGT 

TCCAAAAGGCCGTTCTAGGCATGTTATCCTAGGAAATGTCGACTACACTTTCTGTACTACTGACAATAATTGTGTCAGTCTAGACATCGATTTCAAGGAC 

AATATCACAGACCAAAACATTCAGTTATTGAACAAGAAATTGGGTAAGAAAACAGCAAAGAAAATAAAGAAGGAAGATGCACCTGAAACAAAGGAAAATA 

GTGACGAAGACATATACGCCACCAAGGAATTCGAACAGACAATAAAAGGTCTACAGACAAAAAAAGGTGCCACCGAGGAAAACGCCATCGCGGCCGCAGC 

TGCCGCTGCCACTGCTGCTGCGGTAGAAAAGGCTATGCTATCAGAAAGTGAAGGA 

GACACGTCCCAGAAACAGTTCACTGCATTGAAGAACAGGGAATCTTTCTTCAGTGTTTTGATATTTGAAACTGGATCAGTGATAGTTGTCGGGCTTCAAG 

ATCCTTCGCTTACAAAATTGTGTGTGATTAAAGCCACGACTGATATTGCTGATATTCTACAGAAAAACATCAGTGTGGCTAACGTGTCTATAGTGAATAC 

AGTGTCCACTTTTAATAGATTCCACTTGAACTTTATTCGACTCGGGAAATTCTTCGAAAGAAATTGCATCTCTTACAGTTATAACCCAGAAACGTTCCCC 

GGTATGTTTTTCAAGCTGCGAGTGCCCGCAAAGCCTCTCTTGCCTGGAGAGACrATAGGGGAATACTACACAAAGGTTGCAATGATGCGCGATAGTAAGG 

ATCCC^TTTTAAAATGTCTGACTGGTTGAGGATAAAAACTGCATTAACATTTAAAGTTGGGAAAATTACTGTGCTCGGAGAAGGAGAGAGTGGG 

TGATGTTTCTGTCGTATCCAAATTACTATTTGGTTTATTCCATTACTTTATGGACAACAACATTAAAATGTCCCCCAAAGAAGCACAAAGAGTCAGAGAA 

AAATACGGCATCCCGCATCTAGAATGGTACTTGTACATTGACATGTTGCTCCACTCCTACCCGTACGTCAAACCATCGGCCGAGCAAGTGAAAAGGGCGA 

TGGTGGACCAACAACATATTTCTGAAGTGGATAGGACATACTATGGAACAAAGAACAGTATGGACGCTGCCATGTCTGCAAATTTAGTGCCTTCAAAAGA 

AGAGAGTArCTCCTTCATTAAAAAAATAAGATCACAACAACTTTTTGGACATTTGTGTAAACCTTCAAAAGAAACTACTCGACGTGCTATAGACACACTT 

TCTTTCGATCCTATAAACCAAGACAGGTGGTGGAATAAAAATGACCAATATTACGGTAAAGAGAGATGTGACCCGTTTTCTGTTGCACGTTTAGTGTCTG 

TTTCTGAAAATACAAACAGTATGATGAATAGTCGCATTTCTTGTCAAGGGAAArGGTGGCTAGATGAAAACGAATACAAGGATAAACTTGATCATATTGT 

GGATTTGTGTACAGAAGAAATAGTGGAGGAATGTGAATCAAAGGGTTTTATTGCCTCCCCATTTTTGAGGAAGCACCAGAAGGAAAAAATACCAACGCCT 

TATGTTTTATTAGCGAGAGCCTGTAATCAAAAAAATGGTAACAAAATG^ 

ATGCAAAACTACAGGAAAAACACCGTGTAACTTTAGCCAGGTTGAACACGATGATGGCGTCGTACCGATTTTTGAATAATTACATCTCAACAGACATTGC 
ACCTGATTTTGCCAAGTTATTTGGTAATGATGTATATAGTTTATTACATTTAATGACAAACTTGCCTAAATCCCGTGGACATGCTCTAACATACAACGAA 
AGGGCCCTTTCAAGTAATGAAAGTACATATAAAACACCTGGAAATGCATACTTTAGTACTCTATTTGAAAAATCCATTATAAATAACCAAGAAACTGCTA 
ATAAAGGTAACAATAGAAAACGTAAATTTTCTCGAATCGGACAAGAAAAGAGCTCTTTTCTGTGCAACGCGTGTGGTGTCAATTTGAACAAGGGTAGTGA 
TGAAATCATAAAGGGTATTTGTACAAGTTGCGATCAAAATAGTACCAGTTACATAGAGA^ 

TTTAAAGCAGCTGCAACCCATCCGCCAGTGAAGCAAGAATTGGTAGATTCTTTATCCTCCTCTTCATCTCCTTCTTCTTCTTCTTCTCAGACGTCTAACA 
AGAACAATAGATGCACCCCTAGTGATTTTATAGATTATGTGTACAAATTCACTGACGAAACAACAGGTGCTCCAAAGGTGGGCTTAGTGTTTAAAATGTG 
TGATATTCTTGCATCCTTAGCAAGCAGGAGAGGGATGGAAGATCGTCCCACAGCCAACTATAGAACCTCCTTACATTCAGCTACTCAAAATAAAACCAAT 
TTGAATAAACTATTAGTTTCTGCTATCAAGGAAACAGGAGCCACTGAAACTGAAGCACAGATATTCAACAAGATTATTGGTAGTGAAAAGGGACTATCAA 
TTCTCTGTCAACTTGTGGAAAGGAGGAACAAAGACAATAATGTCTTCGACTGATTrGTCTAAGAATGCCTTCCATGACTGGGTGGTATCAAAGACAGATT 
GTGAGGTGTTTGATGTACACTGTGAGACGGACAGAGATTGTGGCGCTGCTTGCGAGAACACGTACTCTGTTGACGGAAAGGAGGTTACAAAATTCTCTTG 
TAACCAACAGTCGGGAAGATGTGCCAGGAGTGTTTATAGTGCGTCTTCTCTAGAAAGAGCAGCCAATGATCTTGGCCACATTATAGGTATCATCAAGAAA 
AATCCAAAATTGGAGGAAGAACTCCCTGAATCATTTTTGTGGTTTATCAATCACAATGGAGGAGATTTGTTTGTGAATAAGCGAGCCGCGTACTACGACA 
CGATGCATCTAAGCATAGGGAAACTGGATAATGTGGACACTCTTGCCCAGGGTTTAGATAAACGGATGGCCTCATCATTGAGAGAGCACCTACTGAGGAA 
GTTGGACTCTATACTTTTACAAATTGATAAAGTTAAATATGAAAAGGCAAAGAAATGGATATTGGATATAACACAGGAGGCTGGCACCGAAGAGGACAAT 
AAAGAAGAAGAAGATGCGAAAAAAGAGGATCAATCTCTTAGCGTTAGTGAAATTGTGGATGTTTTAACGGGCACACATGACCCTATGCCCCTGAGGGCTA 
GAGGGTTTATCCAGAAAAAAATATATCCTTTGTCAAGAAACGAATTGAGAGAATTAGCCCTTAAGGAACTTTTCCCTGAAGAAACTACATCTCCTCAGGT 
TTTGAGTAGGCAACATGATGTATCTACGCGTGAAGATTTATGCAATGAAAGTATGAATGCAGGGAGGGCAGAATCCATTTTTAGCGACCCTGATTCTGGA 
GAGTACGTGGCTACTTGTGCATGTCTTTACTCGGAATATTTAACAGGGCCTGCGTGTAAG 

AAAGGACTGGAAGACCTGAATTTCTAACTGATCCTGTACTTCATTTTAAAAAGGCAGAAGCTGTGTGTAAATCGACAAATCCAAACTTGAGGGCAATTTA 
TAGTCCAGATAATAAAGGTTTCTTGTGTGCGCCTGTAGCTGAACTTGTAAAGACGGCATTAACTTTTAGGGGTTCACACGAACCGTCTCTCATTGTCGAG 
AGAGATATAAATCAAGCTGAAAATCTACCATCCAATTCATTTGGTGTAAACTGGCCCTATGTGAATCTCCTAAATCGCATTCAAGACCAGTACACGTAAT 
TTGATAAAAAAAGATCGGGGAAAATGTCTGCATCTTTAATATTGGACGAATACCTCAAGAAGACTGCTTCAGCCGTTCTGGATGTAGCTGACTCATTTGA 
GAAAATCAAGGGAGAAATCCAATCACCTGAGGAGGCTGCGGCTCTTTCTGTTGCTCTCTATC 

ATCATCACTGGAGAAAGAACATCTTTAAACGACAAATATCTATCGGATAATGTCCTATTGAAAATGTCTGTTGCTCGCGTTGGACAAGAAAATAATCGCA 

AGAGAGCCGACCAGGCAGCTGATGAAATTAGAACCATCATGGAAGATATTACAGGGAGTTTGTCCGGTGCGTACAGGCAATATAGCCCGCTCGAGGAAGA 

AAATAAGGTGCATATAGGCATCATGAATAACAAAACGCCTAGCATTGTTTGTGGATATTATACAATGGACACATCTATTTCTTCCGA^ 

ACAGATTTTCAAAACCCCACTGTCATTGCCAATGTGACTAAGCGGATGGAGAGCATTTTTTCAAAGGTCGACTCTGCTAGGTCTACAAGATTCGACGCTT 

TTGTTAATGGTGTTGCGAATAATATGGATATAAAGTCATCAATAGATTGGGCAAATATGGTAGAAAATGTGATCAAATTACCAGATTCTACACCTAACCC 

TTGTTCAGTTGACACTATTGTGTCCAGAGACGCAAGTGTAGTTAAAACAGCAGTTAATGATATATACGCTTCTGTTGGAAAATCTTATTGTCGTCCTGCA 

ACACAGCTAACCTTTATGAGCGAGATTGAAAAACTGCGAAAGGCTGCAGTTGTATGTTTTGAGGCACTCATGTCCGATACTAGGGAGAGGGCATTCGTAG 

AGTTCCTATTTTACGTTAGCTTTAAGGAAGATGCATCAAATACCAATTCAAAATTGTTTGTTCAGAATAAGCTATCTTCCATGTCTGGAAACCCCAGACA 

GCCCATAAAATTGGTACGCCGTTCTGCTGAGGAAACACTATTCGGGCTCTGTTTCATGTTTAAGGTAATGCCTCCAGAATTCATGAACTGTATATTTAAC 

TTCCCTACCATTCCCCATTCAACACAATACCATGGTCTATATGGTACATGTTTAACCCCTCTACTTAGAAAATACGGTTCTTCATTCGAAAAGTCCTGGG 

CTCATTTTGAGGAAATTTTAAGCGAAAGAGCCAATGCAGTGAAAAAATTTGGTGTAAACGATACGAGGATAGATTGTCTAGATGCAGTAGCAAATCTCAC 

CGGACCTGTG T ATGTTCTC ATT TT AGATCT TGTACGT ACT C T AAGTGCGC AG AG ATCGT GTT CAACT AAATT T CTCCGTG AAATT AAGG AAAACT ATCT T 

TTGTGGAATAGGTTTGTGTCATAATAAAAATGGCTCAGACATCAAAGATGGGAACTAACAAGAGGTGTTTTGAGGAGGAAGTGGAGGAAGAAAGGCAACA 

ACCTTTCACAAAGAAATCTAAATCGGAACCACCCAGTTTTGAAGACAAGAGTTCATCCACATCTTCTAAGAAGAAGAGCAAATCCAATAAACACACCAAG 

ACCAAGGAAGAACAACTTCTAGAATTCGTGAAGGATCTGGAGCGGAGCGACCCCACTGTTCCTGATGAGAAGGTCAAGCAAGAAGTTGAAGAAAAGTCCC 

CTGAAGCTATTGCTGAAATTTTTTCAATGTTTGGGATCGCTCAAGACAGCAAGTTCAAGAGCCTTCTTCCCATTGAACGCATAAAGAGCATCACTACTAA 

AATTCTTATCGATGCAATTAATCAGCCTGTGCGCAAGATGTTGGTTGACCACCTCTATCATTTTAAGGAGATGCAGAATGTTGTGGAGAAATATAAGGAC 

GATAGCGACGAAAAACTGAGCGTCATTCTTAAGAGTAAGAAATCCCCCAAAGAATTTGACCTCTCCTTTTCCGATTACGTTGATCGCCTTAACAGGATTC 

TGGTTGGTGTAATTAAGAGGGTGGCCGGAGCTATTGAAAGTAAGGAATTGTTGCAGAGTAACAGCATGATCATGAACAGTGTTCTGGGTACTGTTGTGTC 

CAACATTCCTTACAACATGAAGATTAATATTTGTGTGTTTTTGACTAACTTTATTTGTACATTTGCTAATGATGATTTGTACACATTCTTTAGGGATGAT 

GAGAAATTTGTAATGAGTCAGGTAACAAGATACATTTCAAAGGATTAGAAAAATAAAAGATGGTATAAAATTACTTGTATTTTTATTCAAACAAAACCTT 
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TATATTTGTACAATTCCCACTTTACTTCTTCTTGATTTCGTCCTTGATATCGATCACATTCTTGGAGGTGAATCCAATGTTCATTGTAGCTTCGAAGGCT 
GCGCCGTCTTCAAATTTGCTGCACACGTCAATGAGGGTAATTTTTGAAGATGAAGATTTAATGTCCTTGCTCGAGAAGTAAGTGTTCGACATAGTATTTC 
CCTTAATGTTTCCTGCAATAACAGTTGGCCGAACAGTAATGTCGTTGATCAAGAGATCGACGCCAGTGTTGTTGAGGATCTTCAAAGTCATGTTAGAGTT 
CTCGGCCCCAGTGCCAGCGGGATCAGTCCTTGGGGCTGTGACGGTAGAGATGACAAGATCAGCAGAAACATCCTTCATGTCCTTATCGGAGAGACCATTC 
TTCATGGCCACCTTTCCAAGAGGAGTATTGTAGGACCTCTCTCCACGAATTGACATTACCTTTGCCCTTCTTTGAATTGGGACTCGCATCArCTGATCAT 
AATTAGCGACGACGCTTCTTCCAACACGTGTGTTGAATACAATCATTATAACCATGATAACGATTAGAGCAATGATTGCAATGGACAAGATrGCAATAAT 
TGCAACGTCCAGGTTTGTT;iGGTTGCCAAATTCCATTTTTCTTTGTTTTAGATGGAAGTTCTTCCTTTACGTGTTGGTTGGATCCAGCTAGrTAGTTCTT 
CTGTAGCACCATATACCCAGAAAGGGTTTGTTTTTAAAATTTAACCCTACATCGAACCGGATGAAGTGTACCTTGCCGTGATCTAATTTCAGTAACGATA 
CCTCCCACTACCCCTACTTTCATCTCCATCTCTACCCCCTCCCTACTCCTCCCTTCACCTCCTCCTCTCCCCCTTTACTCGTATTGGTCGTCTTCATCGT 
CGTCTTCTGAATTGGATGAAGAATTGTAATATTTTGTGTGCTTCATTCTTTTCACTTGCTTTGCTAGAGAATTGACGCTAGATTTGTTATTCAAAAGGTC 
CATAAAAGTCCTCCGTCTAGAAGGTGTGGAAGATTCTTGGTCAATATACACACTGCTAGCGCTACTCATGTTGGAATTGATGTCATTTGAAAAAGAAAAC 
GTTCTTTTCCTACCAACATGAGAAGATACACrGCTAGGCACGTCCATTGCTGTTGCTGCTGCTGGGGTGGAGGGAGAAACATCTTGATTTTTAGAAGGAC 
GGTACATTAAGTTTCGCATAGGCACAACAGArGTAGATTTTCCGTGCTTACCCTTTTTATTGTATTTCTCAGATGCTTGATTTAGCTTATATTTTATGCA 
GTACACTTCAATGGACAATTTAGCTTTCTTTAAAAGTTTGATCACTACTGCGCTTAGCAAGTGTTTCCTCTTGTCCATGAAATTGACTAGTGTATTTAAT 
GCATTATCAAAAATAGGGGATCTGTGTTCGATGATTGATGAGCCTGCACTGTATGCTACACCAAACCTTCCCCCGCCTTGTGTCTGGAGATCAAACATAC 
TATTCTTGCTGTTATTGTTACTCACTCCAGAATACGCGTCACCGAAAAAGTGGTTTAATGATGTACGGTCATTACTGTCCCCGTATCTTATATTATCGTA 
TACTCGACGAATTGCTTCCTTCTTGTCCTTCTCTGAAAACATGTTGCCGTCTTCACTTCTTTTCTCTGCCTCACTTCCATTACCATCTTCATCCTCTTCA 
TCGACTGCCTTATCAAATTGTATACTGTTTAAGACGTAAATGGGAGGGACAGCCATTTTAGAAGAACAACTAGAAATACTATCCAAAAGAAGAACTTTAT 
AATCTTTTGACAAAGTGAACTCTTTTGGCCAArTTAATAGTACAATTGGTATGTTCAAGAAAATTGTTAACCTTCCCACGTCCATACATAGTTTGTTGAT 
GT GTT TCTCT ATGATTGT AGAAAT TGCCCCGAAAC AAAT TTC TGC CCCT AT AG AACCT AAC C CC AT TTCTTC C ATT ATGTTTGC AGATTC TT T AC TC ATC 
TGTTTTAGTTTAGATGAGGTAGACAAATCTGATACATCATTTGTTTCAGTTTGATGTTTTTCAAAATCCTCTTGGGCTTTGTTGGTGTCTCCGAACAGTG 
CCACTGATGCCTTGGTCACTATGTCCGAAAGCATATTGACTAATCTAGACCAAGAAACAGCACTTGATGTTTTAAAGGATATGAGGTTGTTTAGGAACGC 
CAGTCTTTGACCAAGCTTATACGTTTGACATGGTTGATTATCGTCGTCT^CTTTCAGTATATTTATCAGT/TCCATGTGCAAATTTGCTTGAGTTTGATTA 
CTATAAAATGAAGAATGCTCCACCACCTCTCCTCCTCCTCCTCCTCCTCCTCCTCCTCCTCCAGCAATAGACTCGTCGCCTTCAGCACATACTCCACCAG 
CAGATGAAGACATGGAAGTTTCATCCTCGTTTAATTCAGATATACTTTCAACCTTTTTAAGAGAAGGAGCA^ 

TTCCCCCATTATATAAGTTGACGCGTATGCTGCTGAAAAAAGAGGGTTGGACATGATTTGCTTCCTCTGAAACAACTCCAATTTCTCATTTGTTTTCCTG 
AACAGCTCGGCAGATTTAATGTCTGTCACCGCAGCAGAATCTAAAAATGCCTTCATGGTTAATTCTGCAGCGATGATATTTTTAAGGCGCTCCATTACTT 
TCTCGGATGGTTTTACTGGGTTTTCTGGATCTTGGAACACAACTTTCACCAGTTCAGAGCATTGAAGGTTCTTTGTTGTTTCTAGAAGAGTTTTTACCCT 
TGTTTTTAAGATTATAGACTCCATAGTCTTGTGTAATCTTTCTTCGGCCTCTTTCCCACCTAATGTGTTGTTAACCTTTCCAACCACTGTACTCAAAAAG 
GATGGATTGTTGTTGTTGTTTGAAAATAAGGACTGATTTGTAGGTGGAGGTGGAGGGGGAGGAAGAAGAATATTGGTAGGTTTGAGTTGAGGAGGTTGAT 
GAGGTGGTGGTGGAGGTGGAGGAGGAGGAGGAGGCATCAGCATCGAATCT^GGTCCATGATGTCTCTACTTTCTTCTCCCATTTCATTGTCTGGAGGAAA 
AGTCACGATAGATGGAACAGGCGGTGGTGCTATATCGTTTAGTTGGTCATATTGTTCCTCACCTCCACTACCAAGAATAGAAAACATTCTAGGTGCTTTT 
GGTGAAGGT^CCTTTATCTGGGGGAGAATATXTTGTCTCTTTCATGATTGGAGGGAACACAACTGCATCCTCGCACTCTTGTATATCTGGAGGGGGAGATC 
TTCGAACCCTGGGCGCATTGTTAAATTGATGCATAGTACTTATTTCTTCATCCTCGAGTGTGTC 

TTCTTC AATATATTCCGTGTGAGATTGAACAGACTCTTTAGTTTCTACATGCTCTTCAGATTTTTGTGCAGGTTCCTCCTCCTCCTCTTCCTCTCCCTCT 
TCTTCATCCTCTTCTTCATCCTCTTCATTATCATTGCCGCCACCTGCTATTGAAGAGAAGAAATAATCCATTCCATCATCATTACCTCCTTCACTCATAG 
GGCTGGATTCAACTCTATGAGAGTGCACGCTTTCATTTTCTATTGCATCTTCATCCTTCTCGGGTTC^ 

CTCTTCCTCCTCCTGTTCTTCTTCCTCTTCCTCTTCCTCCTCCTGTTCTTCCTCTTCTTCATCCTCTTCCTCTTCTACGGGTTCTATTTGTTGGTTATCA 
TCTTCTTTGTTCTTCTGTTCTTCACCTGCTGATCCGCCCATGGCGTTCACATTTTCAAGTAAAAAGTTCAACATGGGAAGAACTTGCCTATTTGGCCCTA 
CTGGCGAGGGGGTGTTCACTATACTTTTCC^CTGGrTTTCAGGTATAATTGTCGCATCTTCTTGTTGTGCC^CTAGCGCCAATAACGCAGGGTCAGCCTC 
ATCTTCTTCCCCTTCACCTTCATTGTCATCTTCATATTCATCTCCTTCTTCTTCCTCGTCTTCTAGATTTCCGTCATATTCACCCCCTTCTTCTCCCTCT 
TCATCTTCCTCTTCCTCCTCTTCCTCTTCTTCTTCATCATCATCATCGTACATGTCATTAGTGACATCTCTCACAAATTTACTTTGTTTTTTAGAAGATG 
GAGGATCTTTATTTCTTCCATTTCTAACTCCCGTGTCCTCATCTTCATCTTCGTAATCCTCCTCCTCCTCCTCCTCCTCATCCTCGTTCATTTCCTCCTC 
ATCCTCTTCGTCTTCATACAAGTTACTATAATCAACATTATTATTGATATCGTCCTCGTTGTCGATATCCAACACTCCCGCACCTTCAGTTATACGCAGA 
TTTGATGCCGCTTCCATGATGAAATTTTAGTATACAATATTGTGTTAACCGTGGAAAATCAAACACTACATTTACTAAGTTCCTTTCCTGAAAAAACGAG 
TTACCTCTATAGGGTAATTCTCAGGGTGTATATTCTATAGTTTGGTCTTGAACGTGCTTTTGGTTTTTGTTATAACTTTCTCTAAAATCTACTAACGGCA 
AGAGCATTGACAGTTGAGTGGGAAGGGGTGGTTCAGAGTAAGACCAAAAATAAAATATTTTTTGATTGTGTGAATTTCGAACCGTTrATATTACTATTTT 
AGGGTACTAGAAAACGTTGTCAGAACCAATCATGAGGTTTATCCCGATGTTCATAATTGC 

AATTCAAATTACCTGGATCACAATATCAATAAGGAACTAAATTTGACCCGTTCCCTTCAACTTCGGGGAACATTCACACCAGAAGATATAGCTCATAACA 
ACAGAATTCTCCCTTCCAAGCTGAGTGTTTTAGAAAGAGGGTCAATCATTTTAGCAGAAATGGACAAGTACAAAAATGCACAACCTACAGTAAATAATTC 
TCAAGAGAGGAGGAACATTTCTACTCCTCAACAACAACAAC^GACCACCCCCTCATCCCAATCATCTTC^ 

GCTGATTTATGTTCCTGTATGTGACGAGTCTCGCACCGACGAGAAAGAAGATTTTAACAACGATGAGGAAGAAAATATATTAAAGGAGGAAGAAGTTAGT 

GGAGGAGAAAAGGTTGTAAATATCGTAGTAGGCAGTATTAAGATGATGAAACCTGGTTCAAAAATCAAAACAAAGTACAGGCTCTACTCCCTCCTTGCAA 

GTGCGATAGGGAGTAGAAAGAAAGCTGAAGAATATATTGAACGTCTATACAATTCCTTTTCTTCCATGACGTTGAATGAAGGAGGACGTCTGGCTTCAGC 

GATTTATTGTCCATCCTACAACAATAAAAGGATAAAAAACAATCGTTCTAGGCCTGTGAAGCTGATCCATGCCTCCAGGGAATTGTTATCTGAAACTACG 

GTGAGGGAGGAAATACTACGGAAATCTCCCGCTTCTTCTTCTTCTTCTTCAACGTCATCATCCACTTCTTCGTTTTCTTCAATTTTTTTGTTTGTACCTT 

CTAATTGTACTTCTAAAACTGTATGCGATTTTGTAAAACACATACAGTATGAAGAAGATATTAACCGTTTAAGGTATAACATAATACATATATCTGAAGA 

AAACTATGCTTCAAGATTTTCTAAAATTAATAGATTAACTACATGTATACAAGGTATAAGTAAGACAATAAAGATTCTATAAAAATCAACATTTATTTGT 

ATCATTTATAAAATTCCATATTTTTGTATCCATTTCGTAGTCGACGGGTAGAGATGGTCTGTAGAAAGAAGATGCTGCTCCTTTTAAAAGGTTATTGGAC 

ATGGCTATCACATACATTTTCTTATTGTTTTCCCTATTGATGATATTTGTCATGACGTTGGGAGTGGCATTTTTTGTTGCTGGTGTATGACMACGGCAA 

CATTACTGTTGTTCTTAACCACCTGAGCAAACATGGCTGTAGGGGATTATCTTTCTATGTCCTCTGTAGGGGAAGCAACGCTCGTGGGTTTTATGATTTT 

AAATTTTATAAACTTTGTCACAATATTATCTTTAATAATATATGCCGTGACAGACGTTTATAGAAGATGCAAGCGGCCGTCTACCAATGGATATTCTGGT 

TGTACAACCAATGTGGTTTCTTCGACTTTACAAGAAGCTAATCTCGTTACCACTGAAAAAGATAAACCTGTTCAATTTGTTAGAGGTTTAGTCCCCAGAA 

AAATGATGGAAAAATATAGATCGGACTTGTCTCCTAAGAACGTGGGGGAATATATTTTACCTTCAGAAAAAGAAACAG 

AAAGGGTAAGAAGGTTGGTCTTTTAACTGCCCTGAGTAATGGTCATGACAGTAACAAGAGGATTATAGGGCC^ 

AAGG AC AAAAGTTATGTCTTT AAG AG ATTG AGC^UU^G AT CCGCTCGTCT ACT ACTCTTC TGC AACC T CT AAAT AC GTT AG AAAAT TTTCCCCTTTC A 

CAAAAAAATTCATGACATCAACACAGTTGGGGAGTAAGCTCGTGTATCCTCACCCTATACGGTATGGTACTGCTTTTGTACTACCCACGGGATACGTGAT 

CAACAAAGCATACGGAATGGATAATGAGGATCTACACACTTGGAACCCACCCTCTTCTTCTGTGCTCGTCCCAGACTCTAATAATGATAGATTAACAGTA 

GAATGTGCTAAAACGGACCCAACACATAGAATCCGCATCTACGGCTTTGGAGGGTCTGATGATAATAGACGTGCAAAAGAAGAAGGTTATGTAGAAATGT 

TGTTATGTAATTGTGACAACCACAAGGACTTGCTTAAGGCTCCTCTAATTACAGAGTATTCTACAAATCCAACTGAAATTCAAGTAGATGTTGCTGCAA^ 

ACGTGTTTTATTCCCTGCCCCTGGTTCCGAGCCTGTAAAATCTTCCCAAGTGACATCTGCTGCTCATCAACTAGACGGAGCTACTGGCGAGCACGATATT 

TCCC ATG AGCCCGTG AAGCT ATC AG ATACGGGTG ACT ATGC AGTTGG AT C ACCC ATT GT ATT C AAGCC AGTTT ATGGT AC ATCT T T AGT A AATCTTCC AG 

AAACAG^ATCTCCTCTGGCATTGAACTGCCCCTGCACCGACAAGGCTGATGGAATATATCAAGTCAATCAAAAGGGAGGGATATTATATAGAGATATGGT 

GGGGTATCTTAACGCCAACCCTGTGGAAGCTGCATCACTTTCTTCCTCGGACTCTTCTTCGTGGTTGACAACTGGTAACAAAATATCTTCTGTTACATGT 

GAAGGAGAAAAAATAAAGAAAATTGTGTAATCATATGTGTAGTTTTATTACCACTAAATTTATTGGTCTTCTTGGTACTGTTTACACTAGGGAATTTTAC 

TCTTCCTCCATCTCAAAAACTTTTCAAAAATTTTTCTGGGTCGCTCGAGTTTAGAGGGTGGACCGCTGGGTCGGTCTGATGTCAAGTTTGGAGGGTCGAC 

CGCTGGGTCGAGCCAATGTCAGATTGCACCAGAAATGACAGAGCTCCAGAAACGTACATTAACTTTATACTGTTTCTGGAACATGTATTTCTGGAGACTG 

AC AC GTC C A G AAAGGGT AGG GC AT AGT CT A CTGTTTTC A C T AGGG AATTTT ACTCTGCC TC C AC CTT AAAAACTT TTC AAAAATTTTTCTGGGTCGCTCG 
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AGTTTAGAGGGTGGACCGCTGGGTCGGTCTAATGTCAAGTTCGGAGGGTGGACCTCTGGGTCGAGCCAATGTCAGATTGCACCAGAAATGACAGAGCTCC 
AGAAACGTACATTAACTCTATACTGTTTCTGGAACATGTATTTCTGGGCACGGACGCACCCAGAAAGGTGAGGCTGrTAACACCCTTAATCCACTAGAGA 
ATTrTACTCTTCCTCCATCTCAAAAACTTTTCAAAAATTTTTCTGGGTC^ 

GGTGGACCGCTGGGTCGAGCCAATGTCAGATTACACCAGAAACACCAGAGCTCCAGAAACGTACATTAACTTTTACGTGTTTCTGGAACATGTGTTTCTG 
GAGACTGACGCACCCAGAAAGGGAAGGGTATAACCTACTATTTACACTAGGGAATTTTACTCTTCCTCCATCTCAAAAACTTTTCAAAAATTTTTCTGGG 
TCGCTCGAGTTTAGAGGGTGGACCGCTGGGTCGGTCTGATGTCAAGTTCGGAGGGTGGACCTCTGGGTCGACCCGATGTCAGATTACACCAGAAATAGCA 
GAGCTACAGAAACGTACATTAACTTTTACGTGTTTCTGGAATAGACGTTTCTGGTCATGGACGCACCCAGAAAGGGAAGGGTGTAACCTACTGTTTTCGC 
TAGAGAATTTTACTCTTCCTCCATCTCAAAAACTTTTCAAAAATTTTTCTGGGTCGCTCGAGTTTAGGGGGTGGACCGCTCACTCGGTCTAATGTCAAGT 
TCGGAGGGTGGACCGCTGGGTCGAGCCAATGTCAGATTACACCAGAAACGTTTGrTGCTCCAGAAACGATAACACACATTTCTGGAGCAGTAGTTAGTTA 
TTTCTGGAGCACGTTAAAATAAGTAGGGAGAATAATACATAACATTATATTTTCAATTCCATTTTATTGGTTCTATCCATCTAAAGGTGTTAGAGGAAAT 
ACTTTCATTGTAGTATAAGAATCCTTACTGAAATTTTTCAAATCATCCAAAGAGAATACAATCTTTCCAGTATAACTTAGAGGGTTCCTTGGTGrAGCTG 
TTTTCGTGTCCCTGGCGAGACGGTACTTATTATTGCTTCCCATCTTTAAATCTTCGCTTCTACTCCTCTTCATAGACCTGAATAAGGCTATGGATGAAAT 
AGGGGTCTTTCCTACAGGTTCGACAATAAGTCTTCCTATAGATGCGTTCCGTATCAAGTCTTGCTTGGCCTTCTTCACATTCCTGTCGGATGCGGCTTTA 
CTCCCACTAGTTTTCCATATCTCTGATTCCGTTTGCATCTCTCTAAATG 

TCTTCAACATGTTATCCACTAACATCTTGGATGAAAATGCAGATAAGAGAGGTAAGGGCCTCCCTGGTACAAAACTAGTGTTTGACGCATCAATCACATT 
TCTTCGTGCTCTAACAGCATTCCTTAAAGCCCAAAAGCTTGCAAAGGATAATAATGATGATGTATTTTCATCTGTTGGAACATTTCCCCATAAGTTATGT 
GTAGGAAACTTCAAATCTCCCCATCTTGCCAATAGTAAATATACTGCTCTGTCGAAATTGTTCGTTATTGAAGGATTGGAGGTGGCTGCAGCCGCGTACA 
ACAGGTAGAAAGAAAGGGCAGCAAACCCTTTAGATACATGGACATATTTTGCAGCTTCAAACACATACCCTTTCTTATCCACTAGGTGCATGTTACGCAC 
TTCTTCGTCCTCTCTTTTTTGTTTCATTTTTCCTTCAAAACCTATATCACCACCTAACCCTCCCCCTCTTAATAGACCCGAATCTGCAGAGAGTATCTCA 
ACCATCAACTCCCCTAACCCATTGACATTTTCTGCTCCTTTATTATTACCACCACCACCACCTAAAGCATCGAGTACTGCTTTCCAGTACGTCTCTAGTT 
CCTGTTCTCCCATAGACGGTTTACTTTGTTCCTTTAAACTGATGGTAAAAGTGTCTAGAGATTTTGTTATGAAACCAGATTCACAACGCTGGCAGACGAA 
TAAAGAATGCATATCTTTCAAAAACACTTGAAGAATTAACCTGCTTTTCACTGCAGAGCTATTAGTTTTCTGTGTTAGCCCTCCAGAGCATAAAAATAAT 
GCATTCCATAGCACATACATTGAAAATCGGGCAGCTGAGTCGGGATCAAGAACGCGGGTAACAGAGTAGTTGGAGAACATAACTTGTCTATTATTGACAA 
GAAAATACTCCAAAGrTTCTGGTGAAACAATATCGTTAAACTTGTTACATCTCTTGTATTCACGTTGAAGAGCTTGAAGGGCGGTAGGTGTGCGTTTAGG 
CCCTAAAATTCCGTCTATCGCCTTTGGAAGATATGGGTGGTCCCTAAAGAGATTCTCAGCAGTCATATCCGTAGTTTTAATTCGTTCTCTTTCCCTCTCG 
ACCCATTCTTTTTCCCTTTCTGTGACACTTTCGGATGGCATCTGCAGATGGGGATTGTTCACATATTTATCCAGTrTAGTTGTCAAAGGATTTGGTTTAG 
CGTTTCTGGTCCTGGACAGCACTGCCCTGTACTCTGCGTCGATGGCTGCTGATAATAGTACCATGATTTCCCTCCCTAAAAGAGCTTCATGGAAAGAACC 
TACCTTGGGGGGGTAGAATCCTTCCTGTATTACAGAAACCATGTTGGTTAAAACAGTCGATATAGTTGGAACAACATAAGATTGTCCATCTTCTTCACTG 
ACTGTAACTCCATCACTAAAGGACCCTTCAATAAAGTCTTTTATCCTTTT1TCAATCATGTCAGCAGCGCGTCCGTACTCTAGGTGAGGAGGCATGGCAG 
CAAAGGGATCTAAACCCCTCTCTTTCATTCTAGAGGCGATAGATTGCTTCCATCTTAGAATATCACAAAGAGTATATCCTAGAAATAGAGCAATGGCTCT 
TTTTGCTCGTCCCATTGCCAGATCGGCCACCGTTGAATTGTTTCTGTTGGCCAACATTTCTGAATCAATTACCCATGAAGGTTCATTAGTATCACCTCrA 
GTGG AAT AA AGAAC TAG AT TAGCTGCT GATCT AAAC ACACCGTC C AT AAT ATTCT C AAAGGC AT T C AAAAGGTTC AAT AATGGTGCTTGTTTCAAAAAAG 
TAAAGTCACATAGTTCTTCCGCTGCAGTTTCGGTCAATCTCCAAGAGGGGTCAATTATATGCTCTCTAACACCTTCTATATCTCTGTAGACACTCCTCAT 
GACGGCATCTTTAGATAGGTGTCCTTTGGCTAATCTTAGTCCCGAATTAAAAAATTCTGTCGTTTTATTGAACCACGTGGGTCCGAATACGTCAGAAGAA 
AT AAC AACC TTCCTT TGTT T TTC AGGT GT ATTTTGT AAATTTG AAAT GGC AGCAACT AGCCC TTC C G AT AC AT AATT TAGTCGTTTT TC ATCGGT TT TAG 
TTTCTGCTACTAATGCCAGAACATAGTCCTTTGACTCTGTCATGATTTTTGTGGTGCTGGTAAAGTAGTTTTTGTACACGCTGCAATATCTTCGACCTTA 
AATGTCTTCGGTTAAAGGGATATTTATTTGTTGGTACTGTTTTCGTCAAGGCTGAAAAATAATTTCAAATATACAATGGAGTACATGGAAGAAGGAGACA 
TCGCTGAAAGGCGCTCAGAAGGTGTCGACTATATTCTGGACGAAAACTCTGCT^GTGTAGTTAATGTGAAGAGTATCCGTAACAGGCTCGGTGCCATGGA 
CGCCGAGGAGGCACAGTACGCACAGGACATTTCCGCCCAACTTGTCACCCATATTATCCGTCTGGCCCACTGCTCCGAATCCAACAAGATTAAGGATACC 
ATTGCCAGTATTGCGGGTCTTTTCATCAACAACATCTTTGACAACAATTCAACAAAGAACAAACTTAAAACGTATAATCAATTCAAGGCAGAGTCACAAA 
ACAAGTCTAGCGTTCTCAATATCTTTGGCTCTCTAGATCCTCTGAGTATGCTTTCTAGCTTCATGGGTTCTGATCCAGCAAAGAGTGGAGGGGAAAATTT 
GGACAAATCTTTGGGTGTGCTCTTTGAGGTGCTTCAAAATTACAACCCTTGCAAGATTGACGATATTGTCCTTCTGGAAATGTGCCCATCCAAGTGCGCC 
GCCTGCACCGGTCTCAAGGAAGCCATCCGCCAGGAACAACCCATGGAAGCAATGTTGTTGTTTTTCAAATGTATCAACCATAATAGGTTCAATTTTGGAA 
GCGACATAAAGTCAGCATACGCCTCTGAAACATGCATGAGATACTCTCAGGACGAACGCGCAGTCGTTGTGCCTCTGAGGAGTATCCTCCTCGGCTGCCT 
CGACAGGGATGATCCAGCTCATACTCTTTCCTCTTTCGGGGATACTATCGAGTATGCTGATTCAGATAACGCTTGGGTTTCTTCCCTGTTTGCAGCCGTC 
TCTAGAATGCCTATGGTAGACAGAGCCGTTATTGCTCACTTTTACGTGTACACAATGCTCAGCCGACATAGGCGAGTATCTGGAGACAGCTTCAAGCAGT 
TTGTCTATACCGTATTTGTTCGTATGATTTACTCTGCGATTGAAATTTTGTTCT 

GAGCTATGTTAATGCCATGGTTAACGTATCCGTGCTGGGTTCTACGTTTAACGTACTAAAAGCCTACCGTTCATGGGTGGTGGATCAAGCATCCGTCGCA 
CCGGTTCTAGACATTATTTCCGGAGGATGGAAGAAGAACTACCCCTCACCTGACCACATCAAGAGGGTGGCGTACGACATCTCTCAAGTCATCAATCATC 
TTGCATCACCTTCTAGAATGGTTAAAGGTAACAACAAGGCTAGCAACGTTACATCTGGCCTGGATAGTATCAGGTCTGTTCGTCAAGCAGAAAAATATAT 
CCCGTTTGGAATACTTGAAAATAAAGCAGGGTATGGTGTAATAAACATTGCCAAGCACAA 

TTTAATTGCAACGCTTTACACATTCTACCTTCAATTAAGGGCTGTGAGGCACTTGGGGCACAAAAGGGGAGCGCAGATCAAACTGTCAATGTTTTTGATA 
ATTTTGTCGCATCTCATATGGATATTGCCATGAAAAAGCAGGGGTCGGGGAAGATTCTTGGACTGCTCACTAGCATGATTGACAGGCAAGGTCTGACTAC 
TTCATTCCCTAGTAGTGAAGCGGAATACAAGAAGAGAATCCATGATTTCACAAGATACGTCATCTTCTCTTCAACACCCATCAACGACGAACTAGTCAAT 
TCTCGCTGTATTCTTCCCCATTCTAATGTTCTGAACTCCCCTATCAGCTTGAGAAATATTGACCCAGAATCAGTCCCCGACACTCGATTCCACTTTCTGC 
TCATGATGTGGCAGCGGCCAAATATCGATGAACCTAATCTTTCTGCTCTAACTACAAGTCAGCTAGAATTGTTGCTTAGCAAGAACCAAAAATGGGACAA 
ACTCACCACCAGAGCGTTCTTCAATATCGACAGGATCAATTTCCAGATGGCAGACGCTATCATTAAGAACGTTTCTGGAAGCGGCTTCCTAGATGGGAGT 
AAAACTGCCTCTTCTTCTTCCTCAGCGCCTAACTTTTTCCAAATCTTCAGTGGTGCTGAATGCACTGCAAAGCAGCTCCAAAGTATTCGCAAATTCATTG 
GAGAATCTATGCAGCATGTACAAAAGGAATGGAGTAGTGCAGTAAACAATGGGAACAGAGGAGTAGAAAATTATGACGGACTCAATGCTCAGTTCTCTGA 
AGAACTGTTCGAGCTGCTCTACAAATTGATC^TCGAGGAGGATATGCGGCCATCCAGC^ 

ATGGATGAACTTCTTATCAGAGCTAATGCTTCTTAGATAGGGTTTTTATGAAATAAAACAAATATAAATTATATATCATGCAATTTTATTTATACCrTTT 
TTTGTATGTGATACAATATTTTTAGTGTTATGACAGTGCAACCATTATATTATTATCTGTAATAGGTCTTGGTAAATTTTTAATGAAAGGAAGATTAGGT 
GCACAATCTATTGTCATAAAAATGTCCAGGTCAATTTCTTCAACACCTTCCTCAAAGTTATAGAAGAAACTTGTTTCATCATTTTTATTAGCAATAAACA 
AGTACGTTTTTTCCTTAGCTGTGCTTGCAGAGCTTATCCAATAGTCTGATGCCTTTTTAACATTATAACAGTTGGGGAGTAAACTACAAAAACAGTCCTC 
TGTTTTGTTTGTAGCGTATACCCGTACACGGTTACCTTCTACTACAAATGAAAACTCTATCCGGGCTTTTTCAATACGTCCTCCCACAGCTGCTCTACCT 
TGTTTTTGTAGCACGGGTTCTAATCCAACTTGTAGAGAATCCAGATTCGTTGCTGCACTAATAACATTAAATGTCCTCTGTTTCATGTTGACATGTACAG 
ACATTCGTCCTTGCCAAGTGCGCATATCACGATCACTGATAAAACCTCCTCGTATATTATTTATCACCAATTCCTTTGAGTACCGGGCAAGGTAATCAGG 
TGTAGGTTGAGGAGGTTGTTGTTGTCTAGCTAGTTGTGAAGAAGGTGTTGTTGTCAACATCACCATTTCTGCGTCTGGGTCTGTTGATCTCTGTAGGCTT 
TCATCCCCATTTTCATCAGAAGGATCAACTGTTACTGTACCAAACAATAAAATAT J ^ 

TTTGTGCAACAACTTCATCCATCGTTTCCTTGACTTCGTCATGGTTCGTCCCTATGAGATCGAACAAGTTTTCATCAGCAATTGCCTGTCCGACAT 

GACGTTTATGTCACCTTCAGGGATTGAAACTTTTCCATCTGTTATTTTATCCTTTATTAACCATACGTTTTCTGATTTGTATGTGTCAAATTCCTGATCT 

GCTAATTTGGTGAAGAATTCATCCTCAGCTTTTTCAAGAGTTCTTGCAAAATGCTCTCTCGTTTGACGTTGAAGCGCTGTCTCATTGATAACATCAACTG 

TTCCTCCCCCTCCTCCTCCATTATTATTATTATCGTCGTCAGTTTTTTCATCATCGTCAGTGTATTTAGAGTCTAATTTATATTTCTTACTCTTTTTTGT 

TTTGGCATTTGAATGTAGAGTTCCCTTCCCTACGCCCGAGATAGCAAAGAAGAGGGCAATAGAAGCTCCTATTACCAAAACAATAACCGCAAAAACAATG 

TATATTCCTCCATTGTTTGAAAGTGCCATTTTTAAACCCTTCAATTCTTATCTAAAAAACAAACAGATTGAAATCTATGTATAAATCATCACTCGCGTTT 

TCGATTTCAGGCATGTGAATTGTAATGAGAGAGGCCGAATCCTCGGATGGGTTGATTATGCCATTTGTATCATGACTAATATATATAGGATCATTTCTCT 

TTAGATAATCACCTATGTTATCAGAAGAATCACTATTTCTTAGATTTGATAATATCACTACACTTTTTGGTCTGTTTACAATACCAGCGAACGTGGATTT 

GCGGTTAAATTCTAACGTGAGAGTTTTGGCTGAAGAATCTATTTTGAACGATAAACTTAAATCAATTAAATCCCCTTGAGCAGCTCCAGGGGGAGTGATT 
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GATAACATGTGTTTATCGTTTATTGGTACACTAAATGAAGTTGAAGAGTCGGTAAAGGCAATAGTATTGTACTTGAAAGAAAATGATGTATGATAAAAAC 
CAGGTATATATCTTCTCCACCGCTCCTCTTTTAAAATTTCAGTTATGAAATCTTCCCTGGGTTCCATGATCCGTCrATTCACATCGTATACTTTTTCCAT 
AACCACTGGTATTCCTTTCTCAAAAATTGCTTCACAGATAACAGGATTGAAAGCGTCTTCGTAAAGATGTTCAAAAGTCATTACATTAGTTAAATGGCTA 
TTGATTTCTTGCCTTACTTGATCCTCTATATATTTTArAGAGGCATTTTTTGCTACCGAATAGGCAATTGCACCTAAGCGATACTTTAATTGGTCACTCT 
TTCTTGTGATCCAAGAATTGTCATGGTGAGCAGCTCCCAATTGAGCCTCCACTGAAGCGATTTCAGCTTCAGTCACAGCAGGCTCATCTCCGTTAGACGA 
CATTATTTACTCCTCCTTAAAAGCAGTGATTGCAAACAATATATCGTCCCTTGTATCGCCTCTTTTTATACGTTTAACAGCGTATGTTGCAGATTCTCCG 
TTTAGTATATGGGTGTGTTTATATCTAGATTTACATACAATATCAATATCTTTACTCTTCCCTTCAACAATCACATTAAAGAATATGACTACGCGTTCTC 
CTGTTCGATTAAATGCTTCAAAGCTTAGCACGTCATTAACTACGTCACTACGAGACACAAGAATCACTGCATCTACATCCTCATCATTTATGGAACGAAA 
CAATTTTGCCGTTATTTTAAAATCTCCCAAGGGAATGGAGAAATCGTACCTTGTAAATT^^ 

CTACACCGCCCTACTTGACCAGTTTCTTGTTGAATTTTTTCGTCTATATCTGTGCAAGATATTGGGGGCCGATACCAATGTACAAGATTTTGCGTTAGTT 
TTCGTTTAGCCAAGTAATCAGTTATTATTTGTCTAACATCTATTCCTTGTTTTTCAATTGCCTTTAAAACACAATCCTTTTTTATGTCTTCTAGGCGAGA 
TCTCACATCCAGTTCTGCATCCTTCATGTAAACTCTGTCTCCATGTCTTACAGTGTTCTTCAACAATTCGTTTATGTGTTTGGTTATGGGGTGTATCAAC 
AATTCAGATAGATTAGAAACACCACGTCTTTCTACAATTTTATCAAGGATGAGGATCAACCTCTCATCGCTTATAGTTGCGCCATTAGACATGCTGTTGC 
TACTTGCTTTTGGTATACAACATCAACAGGTGTAAAAAACATACAAAAGAAATAATAAAAATAC^ 

TTTTATTGTTCTGGGGCGAAGATTATATACATTCAATCTTTTAACGTAGACCTTTCTTCGTCTTCTTCTTACTCCTACTTCTTTTTCTCCTCCTTTCTGG 
TCCTGGAACTGGAACTGGGGCTCGGTAACTGGAGAAGTTGTGTTGGTGGTAGTGACAGATATTACCTCCTCTATTTTTGGTATCCTGAGTACAATTTTTG 
GTTTTGGTTTTAATTGTACTGTAGAGGGAGTTTTCTGTGTTACCCTTATAGCCTGTTTCAGGAACAAGGGTAGGTGTCTATACCCAAGGGTAGCCATTGC 
TC AAGTG ATGCTCC AC C CC GT GCT T TT CC C CCTT AT AT AC AATTTCTC ACGTGGT CAAT AAAGG AAACAC AAAAAAT AT TC ATC AATTCTTTT AT TT GAT 
CAAAGAACATGTTTTACACAGTTTGGGCACAGATAATCAGGGTTAAGTCTCAAACCCCATTGGTTCATTACAATGTACTTGACAATATCAGAATGGTCAG 
ATTTTTGAGAAGCATTAGGAAGAGCTAAATTGGTAGAGTGTCTAAGAGCATGCAACGGTTTATTCACAACCATTGCCAGAAACATGGGAACAGATAGTGA 
AGAAACTTGATTCTCATCCGCAATATCCTCTGTATTTGCCCTCTTGTGGGATGAAGAGGACAAAAACACAGGGTATTTCAGATTACTAACCATCAATTGG 
TACACCATACAGGCTAACATTACAGCAACATAATCAACACCAGTTTTATCACTCTTTAGTGTTTTTAATGAGGAAAGTCTCCTCTTACATTCAGTATACT 
GGAGAGCCATTTCAAACAACATGTGTAAATGGTGGCAAGAATACATCAAATCCCTAACATCCATTGAACCCCAGGCATCACACAAATGTTTTGTTAGTGA 
AGAACCAACTCCTATTAATTCCCCTTCCTTATCATATCCTACAACATCCATATCAGGATTTGGTAAACTCATACCAATACAATTGTCAAAATTGATGGCT 
GGCCTATTCTTTCTGGCAGGAGAGTGCATCTTGATGTTAAAATCTCTCCCAAAGAATCTTGTGGTCTTAACAAAGGGCATTTCTACAAATAAAACTGAAT 
TGAGCATGCTATTTAGTTGAGCCCATGCGCCTTGTCTAGAACGAGCAATATTCATAGCCATTGTATTCAAAGATTTAGGATTAATCTGGCACATTTGAAG 
AGGTGATGGTAATTTATTGCACATGGACTCGTAATTTGCTGGTTTCTTACGGCGCAATTGTTGTTCCTCAGCATCACAACTATTGTCTTCTTCTACTTCA 
GTGATAGTAGC^GAAGAAGAAGAAGATTCTCCGTAATTTGTTGCAGCCTTAGCACAAAGACTCATTGAAGTGTACAAATGGAAAAAGGTGTTAGTATTCT 
CCTCTAAACGGGAAACTGATTTTACGTCTCTCAATTCTAGGACCTTGTTTCTGATCAAGTTTTTCATTAAAACTGATGGAAGAACGGATGATACGTCACT 
AATTCTATCCTTCTTTTTGTTGTCACTGGAATGCCCACTAAAGTCATTCC^TGCTT^ 

TTTCCAAATTTGATACATCTTTCCATCATTAAAGATGAATGGATATACTTCATTCCTCTATCCAAAAGCCCGATAATTTTCTTAACATCCTCATTCTTAT 

TCTCAAAGACAAATGTTGTATCACAAGGAAAAACAATAAGTCTTTTAGACATAGCTTCGTCGACAAAGAATGAGAAGATGTAACTGTTGGACATCATAAT 

AGGAGACACAGTAATGCTGCTTTTTGCGCCCTGTTGGAAAATGCCTCTAATCATGACGCTGTTTTGGGGTGAAGTCATATTCTTTAATAAACTGGAGGGA 

AATTGGCCCGGAAGC AT AT GTTTGGTT AT AG AACTGG AC AC ATC ATTCTTTTGGGGG AT GGC CTT AGCG ATTTCGGT AAT AATGG AAG AAGATC C GT C AC 

TGTTATGTTTTTGGAGTGCAGACACCAATTGCCTTCTTTTAGGCGTATCGACATTGAAACAAGCCTCTTTATTACTGTCTCCTCCACCACTAGAGTTAGA 

AGCAAAGGAAAATTGTGAACTTTGTTGCCTATTTTCTCCAAAAACACTTAAAAAGTTTGCAATATCACTGCTGTTATCGTCTGTTTTAGGTAAAGAA 

GTATTTTTTGATGAATTAGAG<3AAGAGTCGTCCTTTTCGTCAGAAAGCAAATTTTTATCCATACACATGCGAATAAAGGATCCAGAATTCTTAGCCGACA 

TCTTATCCTTCTCAATACGCTTCTTTAAGCAAGAAAGGACATTTTCGCAGTCAGAAGTGGAAGTTGACAGAACCACTGAACCATCAGAAGTAAATCCAGG 

AAAACTATATTTTGACGGTTTCTTAACCACATCGAATAAAGATGCGTCATAACGTTCGTACATGTTCATAAAGTCCAGATCTCTATCCACTAATAGTTTT 

GCAGCTTCTGGGTCCTTTATTATGGACGATAACTTGCGAGAAATAGTCTTGATGTTACGAATACTGTCAATATTAATATTCGACAGAGTAGGGCCCAGAA 

GAGCCTTCTCGGCAGCATCATCGACAGCTACAAAGTGTTGTTTCTTGGGAGGGGGAAGTAAAACTGGGT^ 

TCC^GTGTAAACGCAATGGTACGCAATCAATACACCAGGGGCGGATGTATTACCATCAATAACTGACTTTTTAACGCCCCCGTTAGTCGACAGGCAATTA 
TCCCCCAGGTATTTTTTAATTTCGAGCATGTCCTGGGCAGACAAAGATGACGTGATCATCTCGGAAAACCTTTCAGTGGCAGGGTTGCAGACTAAAACAG 
GCGAACCGTCGCGATCAAATGATGTATTCCATTTCAAAAGGCCATTGTTTTTCTGTATTATGTTCATAATCGCATCAACTTTGTCTTCAATTTTCATATT 
CTGCTTATTTTTCTTGTTCTTGCTCATAAATTTCTCATTCCATTCATCTATATCTTCTTTACAATCGTTCCAGGAATGAAGATTATTAACCTGGCAGTTT 
TCATCAATAATGGATTCAAATGATTCGTTATCGCATATTTTGGATCCAAATTTACCTAATATGCACTTCAAGGCAGTTTTTACTTCGGATTTAGAGAATT 
TGCCTTCATATTCGATAGGTTTCTTACAGTGCCTGTACACAGCATCGGTTTCGTTAAGCCAAGGAGAATCTGTGCAGTCTTTAATAGAGAAAGGCATGGC 
GCCTCTATGGAGCATATTATACACAATATTTTTCTCTTTAAAGGTAGTTGAAACAACACCAGAAGAACAATTCCACCAGTGAGCTTTTCTGGCAAAGAAA 
GAAATATTGATAGGATCCTTTATGCGGACACCATTTTCGTCATTTTTACCGTCGTTTCTCTCAATTTCCTCACATTCTACCCCTTCATCTTCAATTTCGC 
CACCTCTTTTACGTTTTTTTGTATCATTACAAGCATTTTTCCTATTCTTTTCGTGTTTGAGTTCAGAAATAAAATCGTACTCTTGGTCGCATACTTTTAG 
TTTTGATTCGGTCATTAAACGGTTAATATTACTAAAAACGATCTTAGAGTTTCCTCCGTCACCGTACGCGTTTCTTGCTTCATTGATGTCAACACTATTC 
ATTTCTCCCCCACTGCCGCCAAAATCGACATTACCACTTCTGGAAGCAGACTCGTCCTTATCGTCTTCGCTCGATGACTTCTTTGAGCCAAATGAACCTT 
GGTGGCCAACTTCGTCAATAATCACGATACCCTTTTTGCCAGCAAAGGAAGCCAAATTTGCTTGAGTAGATGAAGAAGAATCCCCAGCATTATTGAGAGC 
CGTAACGTTAATATTTGTAAACCTTTCGAACCAAGTCCTCTTACACAAATCGACAAGAGTGGTCTTTAAATTGTTTCCCTTTGTTCCCACAAATACAACA 
GCAGTTTTTGTGGCAATATTCAAGGCAACGGAACAAAGAGTTTGAACCATGTGGTGGGCTTTCTCTCCATCTCCTGAAGCAATGGCTGTGCACATACCTC 
CAAAGTTCATTAGGCCACGTAAAAGATACACGACATCCATTCTGGGCATGCCAGTTTTTTGGTCAACACATGAGATGTAACATTTTCTTAGCGCATGAAG 
GGCTGTAGTTGTAGAATTAACATCATCCTTTTGCTTCCTCTTCTTATTACTACTACTACTACTACTACTATGTTTTGAAGAAGAAGAGGAGGAGGAGGAG 
GAGGAGGAGGAGGAAGAAGCCGTTTCACAATCGCTAGTGGCTTTAGCAAGTTCGTATAAAATTGATAAAAATATATAATCTTCATCAGCAATAAAAATTG 
ATGGAGGTTTTAGAGAAGAACTGAATGCTGTTTTTGATACAGGTTCGTACTTTACAACGCTAGTATTTTCCGTGTTTTCTTCGTTCTCATCCTCATCTTC 
TT C ATTT TC GTC ATC ATC ATCC TC ATTTT C AT CGGTTTCTTC CTG AT CT TCAGCC AT AG ACC AAAAGTTTTTGTTC AT ATT ACT G AC AG AGTCGGT ATTT 
TTCATATAAGATACGTGTTCGATTAAACAAAGTCCTCTACCCATTGTAGTGCCTTCCATGGACGAAGCTAGAGCAGCAGAAATAAATTCGGTTGAGAATA 
AAGAATTCAAAATAACGTGTTTTTCTGCCTTTAATTTACACCTTGTATACATGTCCGTGACCAAATTTATCCCTTCAAGGCGTCCTCCTAGAAGATAAGT 
ACCATCAATCAAAAAGTACCTGTTATTATCAACGCTAATACAGGCAGTGTTTTCAGCTAA 

GCATTTCCCATTTTTTCAACTTCAGCCATAATTTTAGAAATGAGAGCAGTTCCTGCATTTGTAATACAAAGACGTATATTTTTGGCTCTTTTGAGCATAT 
TCTTGGCAGGACACACTGGATTAGCTAAAAATGAGCATTTGTATGCTTGTTCCGAAAAAATATTCTGCTTGTTGTAGTCTGACAAAAAATCGGGACAGTA 
TCTATCGTGGAACAACTCGCCTTGTTCTGACTCATTGTCAAACATTGAAGGAGTTAATCGCTCGCTGAAACATCCCCCGTTTCTAATTTGGAGAGTAGAC 
TCGATCGCCTTCTTGTTATTCTTACCCTTTGATGATGAAACAATACAGGCCCA^ 

ATAAGAAATCGCAACACTTCTTAAAGATTTTTGTTCCTCTAGGAGCGACAAATTTACCCTGCACAGGGGAGTTAAAGAAGGAATTATCCAAGTAGAAACC 
CTTATCTGTTCCTGCTAAAAAGTTAAAATTATCCTTCTTAATGGCCAGACTAGATTCGTCCAGACTTTTCCCATTAATGACAGACCGCATAGAAATTAAC 
ATCCTCTTAGCATCCATTGAATCTTCTTGCAGAAGGCAAGTTTTATCTGTACCACCTTGATCGTCATCATCATCTTCATCGTCTGCGCAGCCAGTATCAA 
TGCCAGCATATAGACCAGAATGGTCCTTATTGTAATTGATGACACGAAGCACGCGCTTATGGCCATCAGAGTTGGTACTTAAAAACACGGCAGTTTTAAG 
GATACTGACAATTTCAATTTCTGATGAAACAAAGTTGGGACATTTCTTAGTTTCTTCTTTACCAATGAAACAGTAATCATTGTCTTGTTGCCGATAACAA 
GATTTGTTGAAAAAATCTCCTCCTCGTCTTATATTTGTCCCTGGTGTAAACAACACTGAACAACTATTAGTGATCCAAGACGTACAGAAATTACCCTCGG 
G ATC T AAAC ATGAAGAC AAAAC TTC AC GGGCGT AAAC AGC C TT AGCC AAACT AGCTGC AG AAAT TTT TT C GGGAG AAGG ATC AG AGG AATT ATTGTCTTT 
AGATTTAGCAGAGAAAGTGCGGTGCAGATTCATAAGAGGGAAAGTGTCCTTCAGTAGTTGGCCATAGTCTTCCACACTCAGGAAACCAGAATGGTATACT 
GAAAGTGGAGGGTTTGAATCGGTAACGCCACATGGAGCATTATTCTTTGTCATTTCAAAGTGG^ 

TACACAAAACGCTAATGTGACGCTCGTACAATTTAATCCACAAAATGTTTAGATTTTTAAACAGCAATTTTTTAAGCAAAACTATCTGGTTCTTAGGGGA 
C ATACC ATTG TT ACAGTTC C AC AT C TT AC C C AAAC TTTT AT AGGTGTTG AT ATTGGAT AT TTTTTGGTGGGT AAAGT ATTGC TGGG AGAAAGTCTGGTC A 
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AATGAAGATTGGTTAGAAGAGCCCAAGATAACGTACGCGTCTTTAGAATCGTTAACAATCTTAATGTTTTCTAATTTTTCAGTGGAAGTAGTTAGAGAAT 

CATCATAAAGAGACAATAAATCTTCATCATAACCCATAGGAACATTCTTTAAAGAGTCTTTTTTCTCCAAATCTTGAATTGAAACATATTTCACCGATGA 

ATTACTACTGCTATTAGAGTTATTGTTAAAGTTCTTCCAGGGTCTCTTTCCAAACTTCTTTTTTGACTTGTCCTCATCTCCCAGGGCCATATTCTTAATG 

TAACAAATTGGCATAACTTGAGACCAGTACGAGTCAGAAATATGATTGGAGAAATAGCGTGTCAGGACCTTTGTTTCAGCAAAACCTTTCCCCTTGTACT 

GGATAAAAGTTCTCAAGGGACGGCCAACTATTCTACCCTCATGGCCTGTATTTGCCAAAACACTGCCCTTTCCTTGCTTCrTTTCCGAAGAATTGACAAC 

TTTCTTGTAGAAGTTAAACCAAAACGAGCAATCACTTCCTTCATGTTTCAATTGGTCTrCAGAAATTTCATTGGTCTGGTTAGCTTCATTATTATTATTA 

GAGAGAAGGTGACCTGATGAGGATGATCCACCCCCACACCCACCATTTCCTCCCTCGCCAACAGAAGACATGTAATCGCCAGGACATATGATGTTCATAT 

CATAAACGCTTCCAGrATCAGTCAAATCGCGGCACTCTTTTCTTATCTTGTTAGTTTCTTCCTCAACTGCCTTATTCCACCTTTCAATGGCATCCATTCC 

AGCACTTAACGCAGTTTCAGTGACACCGTACAAAGCATACTCTTTTTTCGAGAAAGGAACACAAATCTTAAAAAAGTTCTCAATGTCAGAAATTACTTTC 

ACAATGTC^TTATTTATCTGATTTTGTGTGATCATGTTCTTAATTCGGTTAAGTACATAGCCATCGTACGAAAACAAGCCACTTTCTCTGTCTTCTTCGT 

TTCT GGCGTCCATGTGT ATCTGGC TG AAAC CCGTGCGGG ATC TCG AGTCCTC Arc AACT ACGGATGATG ATCCCCGACT AG AATC ATC AT TGT AAAAACC 

ACCCGCGTTATTAAAACCTCCTCCGCCAGATTTCTTTGATCGTCCCCTATTACTTCCACCCTTTCCTGTCATTTGTTTTGCACCACTATTATCCTCCTTG 

TTAATCTCTTCTAGAGAAGGATCGATAAATGCTTCTTCGTTAGCGTCCATTACTTTAGCTTGACCGGCAGGAATGTTGTGGAAAGAAATGTTAGTGTTGA 

TGATTCTAGAGGACTTTTCTACAGCAGCGAGTTGGGCAGTTGTACAGGAATTACTAACCAGATTGTAGTTAGTATCGATTCTAGTCTCTTGCAGTTCTTG 

GCGAGTCTTGGCGCGGAACTCGTCCAAAGATTTAACACTGACATCATTAACGTCCAAAATACAACAATCCAACAAATCATTTGAAGAAGCACGC^ 

TCTGCCGCTTTTACTAAAACAAGCCTTGCAATACGTTTCATTACCTTCCCAACACGTTTTTCATACATATCATTGCTTTTTTCGCTATTTTTGTCGGCCA 

AAGAGGAAACACTAACCAAATTCTTAACTAGTGAAAGGGGCCTAGAAAGAAGCCTGCTTCGCCGTCCCATTAGGACGrGAGAAAATCTATAGGTGATAAA 

TTGCAACCAATTACTAGTATCAAATCCATTAAAATCAAACGACATGAGAATCTTTCGGGTCATTTCTTCAATGACAATATCATTACTTGCA^ 

ATAACATATGCGGCAAAACGGTAAAAGTGGAGAGGATCTTCTTCAATATCGTAAAATCTACCTTCTTTTATGCTAGCGTCGGCCTTTTTGGGAGAATCTC 

CCTTGCCTGTACAGCTGCACCACTTGCTAGTGATTTTAAAGCATCGAATGAGTGGTACCATTTTATCCACAGATAGACGCAAAGGAGCAGTTGCAATTTC 

TCGCACACTTTCTCCCTGCATCATGATACGATACATAGCGACCATTGCTAGCACACAAGAGAGGGACATTTCTGTCTTGAGAACCTCTTCTGTTGCAGCC 

ATCCACTCTGCCCGACTGCGCAAAGTAGACTTGTTTCGTGGATAATATTCGTCT^CAA 

TCTTC AATTTCTTAATT ACGTT GTT ATTTGTT AGC AAGTTGC TTG AATTT ATGAAAT TT TC AAC AGATCT AATGTCGTT GTC TT TGAGAGC AAACTC AT A 
GGCCTTGGCGACCTGCGCTTTCC^GTCCTTATTGTCGTCATTCAAAAGACATTCCCACGTATCAGCATCCAAATTTGTGTCCAGATCTTTACCAAAAAAT 
GCCCTCGCGCTAACGCCCAGAAATTCTGATAGCTCAAATACGGCGCCAGATTCGTCCACATTTGACACACCTGCCTTTAAATTTAGCATGTCACTACCGT 
C CCAATCCTTAACT AAAC TTCCCCAATCAATAGGAAGGCTGT TAT TA AAGC ACTCCC T AACAGTCTCT AG AATTGTC C AC AGTC T AG AAGCTGG ACT ATT 
GGTTC C AGAACCGT ATC TT GTT ATC AAAT TTGAAAAG AAC TC AG AAT C AT TACTTGT TT TTCCAT TC AAAGCTGT ACGC ATG AT AC ATT C TTCC ATC ATT 
TTAAAGCACTCGTATATGTTAAGTTTTGAAACGTTATGGTCCTTGAAATAATTTGCTGCATC 

CAGAAAGCGATTGATTAAACACTTTAGTTGGTTTTAGAGGATCAACAACTGTAGAACCCAGCACAACAGACTCTAAATCTTGATTCGCACACAATGCCCA 
AATACCAGAAAAGAAACCATTATTAACACTCTTATCTGATACACCATCACCTCCCGATGAACATGCCAATCTGGCTAAACTTCTAGGTCTAAACGTGTTA 
ATGAGTTTGTTATCTTCACTGATCACATTTTCATAGATACCCAAAGGTAATGCTGAAGATAGACATTTTCCTGCCAATTTACACTGTTTTATGGCATCCC 
ACCTCTTATTATCTAAAATAGACCTAATATTACCATTAACTTTGTCCTTTTTCACAAATCCTAAAGCTGCAGTAACATCACTAAAGTATCTCCTTTCACA 
TGCCATTGTGTTGAAACGGAGAGCTACAGTCAGATCTTTCAGATGTTTAGAACTAGGAATTGGGAGTAGTTTTATGCACTTTTCGTTAGACATGGTATTG 
TTACTACTAAGGGGAGAATCGCTCTTGAAGTTATTAAGAGAAGCACAAGAACAAAAGGGAGGGATATTGTTGTTGCTGTTAGCGGTTCCGTCTTCACTAT 
TGACATTAGGCTTGACAATTTCTCGTACAGGCATAGTAAAAGGCACTACCCAATTCCTCATTCTCAAAGATCTTCTATTAATTTCTTTCAAAATATCACC 
AGATTCAGCTGCAGAATTGTCCAATGTTAGTACATCACACAAATTTTTCTCCATTTCCTCCACAATGACAGTTGAAACArACTTTTGGCGATGACATAAA 
CTGGATCTAAAGTTGAATGTTTTGATGAAATTAACATCCCTAAACACATCATCTTCTAC 

CATCACTAGAACCAAATTCGCCTAGAAAACCATAACAATCATCTTCATCATCACCACCATAATAATCACCTGTTGCACCTCCCTTGGCAGCTCCATAATT 
CTCCTCATCGTCACTCTCTTCATCACTTTCTTCATCGCTTTCGTTACTGGCCACATTGTTCAGGATCTGGCGAGCAGAAACGTCGTCGTTGTCACATTTT 
CTTAGATATTCTTGTTTTTGTTGATAGAGTTTCTCCCTCTTCTCCGACTTCTTGTCTGCCAAAACAGCTTCCTTATCAGACTTGACGCTGGCAAAAAGTC 
GTTGGAAAGTCTCCATGGGGGCGCATTCATACATTGGTCCAAAAGACTTGTCAGATTCATCATAATCAAACGTTTTAAATGTATCGTTGTTCAATTCGTA 
GACCATTGTGCTTTC^TCATACTTAATTTTGCACAATTTGCTGTTTGTCTTGTTCTTGATCCTCTCCATGGAATTTTCGTATGCTGCAACTCCCATTCTG 
CAGACAGATTGCTTTTCAAGCACACTTTTGGGTACATATATGCTAAGTTCTCGGTTAATTGCGTCGCACATGTGT^ 

ATTCAAAATCTTCACGAATTATTTGCTGATAGTACTCTTCTGGATGAGAAGTGATGTTGTACAGCGCACAAAACCGCTCAGCAAGAGTAACGACTGTCCA 

CGCGAATATGGTGTCGTTCTTGTCAGCACCCGTGGCGAATGTTGACGTGATGATGGACAAGAGCTTCTTGATATTTTCCAAAGAAGCTATCTGGTTTTCA 

TCAGATGACGAAGTGGATGTTGCACCTTCTTCTAGACTAGAAACCATATCTGGTGTTGTTGATGATGTTGATGCAGCAGCAGCAGCAGCAGCTTCTTCTT 

CACTTTGAGTAGGTGGAGGAGGAATACTCTCCAATTCTTCAATTGTAGGAGGTTTTAGTGATTCTTTGTGGGTAAATTGCCACATTTCTGCAAATTTGCC 

CTTTGATAGTGCATCCCATGTGTTATTCTTGAGTCTTTGTTGTAGAATGTCCGTATCTCTGGGCAATTTCAGGTTTCTGTTCTCATATCTCACAACAATT 

TGAAGGATGGCGAAAGGCATGATTGTGTCAAACTTGATGGTGTTAACGCCAGTCAAATTCCTGATAATGTTCTTTGTTTCTGGTTTCACGCGTTCAACAC 

AATTCTTCTTGCTCATGGCTTTCTGGGGAAGAAGAAACTCCATTGCCAGAAGAGGAGA^ 

GAAACACACAATCTGOTCAAGCrTATCATCGTC^TTGATGATGCTATTATAGACATra 

TTGATGTTGTTCTCGTCTCCAGAACCGCCGCCAGATTTCCCTCCAGATCCTCCTTCCTCATTATCTTCTTCCTCGTCGTCTCCATCTCCAGAATCGTCCA 
ATCCTTTACACGCCATCTTCTCCTTTAATAGGGTAGCATTTGTCTTGGTCAATAACAAGCCACCTCCTTCTGCCAGAAACTTGTAACACATGCCAATTGT 
ACAGAAATCTTGATCCATAATCAAACTTTTGTTTCTGTAAGGCATTTCTGGCCCAAGTTCAGAAAAAAGGATCTTTCTTCTCCAATCGTCGGTATTGGCC 
ATAGAACCCATGAGGTTTTCGAGATATAGACACTTTTTCTCGAAACTTTCCAGTGTTTTGCACATCCACTCTTCGGCTCTATCTGCCTTCACGACACCCA 
GAGAAGTCACCATGTTTTCGTTGCTGATGGAGAGAATGTTCTTCTTTTCTCCCACCAACTGCGCATGGTGATAGGGGAACGCAGTGTCGCTCACTTTACA 
ATGTCCGGGAAGATTCCAGTTATCACTCCCGCO\ATATCAATCTTGAAACAGGCAACAGTTTTCTTGATCATACTTTCAACACGTTCGACGATTCTGGGC 
ATACATTCACTCAGTAATTCAAGTTGTACCTTTAAGATTGACTTTAACAACACCACATTTTCTGTGTCTGAAACAGCTGCAAGACGTTCCTCTTCAAGGA 
TGGTTGGGTAGAATATTACCTTGAGCCCCATCAAAGAGGCATTGGTGATGAAGGCAACCTTAAACACTGGCACGTCTGTTGCTGTTCTTCCACAACCCAG 
AAACTCTCTGAAGGCAGGAGTCACACATAGTTTCTCTGCACTAGAAGTTTCTTCCTCATAAAGCCTCATCATAATAATAAGTTCCCTGAGAAGCTTTCCA 
GTTCCAACGTGGGGTCCTTCTTCATCCATGCGCCTATTTTCATTCTGGCAAGAACGCTCATTCAGCTTGTCCAGAAATTGATCACCAGTAATTTCTGGAA 
CCTTTTTTACTTTGTACATGCCTTTCTTCTTGGATTTTGCAACATCCTTTAAAGTTGCAGGTTGACTGTACATTATCAATTTTCCATCGTTTCCTCTGTT 
CACTGTGTCATATTCAACAACATTACATCCAGCATACAGAAGATTGTTGGGGTTGACACTGCCGTCTCCGCCGACTCTATCTTGTACAGCTAGATGCCCC 
ATTTGTTTGACCAGAGGGTTGCCATTCTTGGTCAAGTTATCAGTGACTAGAATTTGTTCCCCTTCGTTGAGAAGGAGAAGAGGTAGACGTCTGTTGTCGT 
ACTCGGGCAGTTTGTTGGAGAGACAGATGCGTAGAGTCGTTGGTTCCTCTTTCATCTGGTTGGCACATTTTATTAAGTGATCTCGCACATCACTTTCAAG 
AAACTTGCCTCCTCCTAGGTTGAGCACATTTGCCTTAAACATATTGACGAGTGCAGTGTGCTGGTTATAAGTTTTGTTTTTTCTTGGTTAGTTATTTTTC 
TGAATGTAGGATAAAATAACGTAACTTTTCTTGTGTCGTTGTTCACCAAGGAATCTTTTGATATCAATTACAGCTACAACTGGTAAGGGGGTTGTCCGAC 
GACCGTCTGTATCCTACGGTATCCAGTGTCGGACTGTGAGGCAGAGAGCTTGTGAGGGCGTATTTATACAAGAATAGCAGCTGGTGCCAACTAGTGATTT 
T AT AG AACAG AAATCATGG ACC AGT AC CC AGAAGTG AGGGAT AC TC CTC AG AC AG AAC AGGAAC AGGCGGCGGCGCAAC AAC AAC AAGC AGC AAC AAC AA 
CAGCTGCCGCTGCTGCTGCCGCTGCTCCTACGCAGTACAGTAACACTGTTTCTGCAGAAACTTTATCCGCCATTTCTGAAGATGGAAAATTGGAGAGGTC 
AATCGCAGCTTCGTGCTGGATCAATAACCTTAACCCTGATGAAAAAATGGCTCAACGTGTCCAATTCCATCCACTTAGTTCCACGACCACATACGATTCA 
GAAAATGTGAACCCTGGTAGTTCTGTTGTGTTTTTGAAGCCTAGAGCCCTCCCCACCGGGGGCACGTGTTTAGCCCCCAACTACATTGCAGTGCCTACTC 
TTCGTGCTGCGTCAGAAATTATCGATTCTATTGCATCAACTAGTCTATACCAATGTTCAATGTTCAATTCGTGGAATCTTATTCCCATTTTTATGAGTAA 
TAGCAAACATAGTCAATTCGGTGATCGTGTCATCAAGAGATCAATGATCCGTAATTCTTTTTCCAAACAAAAAAATGTAGAAAATCTTTTAAAGGAATTG 
CGCAGGAGAAAGGTTAACGCCGCCAAGGCATTTTCTCATGCAGTTCAACAGAAATCAGCAGTAAATACTGCCCTTGCAGCATGGAACGCGGGAAGTGCTG 
CAAACCTCGAAAAATTAGTAGATTTTTGCAAATTGAAGTACAGTCCCGATAGAAAATACAAAGCAGGAGGTCTTTTTAGTGCGTCAGCAACAGCTCAGTC 
ACAATCAGGTACTTCTTCTTCCTCAGTTGAACATACTTCTAATGATTTCTTACTCGATATCCTAAAACGTCACAAGGGAACGTCCCTTGATTTAGATTCA 
GCAACAAACACATTTGATACTGCTCTTTCTAGGGTTTTTACCGAGTTTAAGGAACAGGCTAGAGCAGCCGTAGATGCAGCCGCAGATTCAGATCACCTAT 
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CCGCGTCGGACCCTATTTTCTCTATTGTTCGACACAATAGCAGGCGTGAGGGTATTCTAGATTCAGTGCCAAATATCGGTATGTTGGCTCCACGATCAAA 

ATATTCTGTCGCAGAGTACTTGATGGCAGACAGAGACGAGTCTGCAGATATCGCTGCTAAGATAGGGACAAAGATTGCAACAGATTTTGAAGCATTGAGA 

GGAGATAATAATAAGAGAAGAGCAGATACTTCAGTCGATGATCTCAAAGAATCACTAGCGGATTCTATTGAAAAAACTTCCATTAAAAATACCGGCGATA 

TTAATTCGGTCACAAATATTCCCACAGATACAGAGGAGTATGAATTTTCTCTCCACATCACACAGCTTTTCGCGCAGGCATTTTTGGAAACTATGGGAAG 

TTTATTGAGCTGTGCTTTCGGTGTACAGTTCCCCTrCTCAGATGAAGGTTTTGCGGCTATCGAAAGGATTATACGTAAAACAGATCCTGATACTGGAAAG 

GTTTCAGAAATGGACCCTTCCTCTCTTTCAGATCAATATCTCTTGTTGGTAGGAAATTTCCAGGTTTCTCCCTTCCACGTTTCAGATCCTAAGGATATCG 

TTTTCGGTAGACAAGTTACGCCTAATACGCCTATTTTGAGTATCATTACTAGATCTAAAAATGATAAGAATGAAACGTCTACTATTATTAATTTCAGGGA 

TAGGCTTCTTGTTAACGATACCGTGTTGAGAGACGCCACGCAAAACGTCTCAACTTCCACCCCTTCTCAAAGAAGAGTCCCCACCGCCGCAGGTGAACCA 

AAAAAGCCAATGCTCTCTGGTTGCCTACCTATCATCAGGGGACCCCAGGTAGTGACCAGGGAAAGTGATGATATGATTAGTGGTCTTGTGGGTGATTGGT 

ATATCTCTTTGGGGGTCTATTATGCCATGGGTTCATCGGCTGCAGCTATTGCCGCTGGCCACCAGAGAGCTCTCGCTTCAGCCGAGTCAATTAATTCACC 

CATGATGAAAAAATTCTCAAAGAAGGGAGGAAAGTATACAGAGGAGGAAAAGCGAATCAAGAAGGCTATGCGTAGGAATGCAGATCGTTCTGCTAGGATT 

TTGGCTTTGTTGGGGCAAACTGATGCCCAGTATGGGrATGTCGAACATAATTCTACCCTAGATTCGTTCTGGTCTTCGAATGCCGCTATTCG^TGCAAAGG 

CAAAGGAGGATGCTCTTAGCCGTGCAGAAATCTTGGCAGTTAGGAAACAJ\TTGGACGGAAAATGTTCCTCTTCAAGGGATGAATATTCC 

ATATCTTAGAGACTCATTTTTCAGGTCAGTTAATAGGTCAGGAGGAGGATATGAAArGTTTGATCAAGGTTTTGATATGGGAAGATTTGCCGACTTrTTG 

AGTGACAACTCTGCTGCCAGGAACGCTTGGCAACAGTACGCAGAAGTAATGAGAGGACTTTCTAAGCATGAGAAACGCGTATTCAACATTGAAGGTCTTT 

TCAGTGCTCTAAATTCTTTCAAGTTCCCCCTTGTTCCAGAACAGGGGCGTAAAAAGACTGTAGGTGGAAGGCATAGGCTTAACAATTTGAAGGCGGCCAA 

TAAGATCATTAACGGCATCACAGAGATGACTCTCCAGTCAGCCATCGATGGTACTGGAATCTCTGATATTATTGGTTCAGTTAGCGATGGGTGGGGAAAT 

ACTACAGCTCAGCCGTCTCGCGTCAAAGCTCTTAAAACATTGTCCAATTTCAGCGGAAACGGAAATGTTGTGTCAATTCCAGTTAGTCGAGCAGTAAAGT 

GTGCCGCGGGAAGTCGAGGTGGGGAAACTCTCAAGTGTGTGGACATTCCTTCAGTCATTATTGCAAACCTAATCTCTGATAAGAGAATTCTAGATCAACT 

TTGCGGAGGAGGGATGAACCTCGCTCACGAAATCACAAACTTTATCGAGACGATTGCAGGTAAAGAACATACAGGAAAGGAATCCGTTTTCTTGTCTCCT 

AGATTGTCTGTCATTCTTTTGCGTTATATTTGGTTCAACGCAGCGGTTGTTTCCCTTACGGATAGCAACATTAAAATGCCCCTCAACACAATGTCCGAGG 

GCACTGGCGATGACATrTATAGGGACTATTTGGCCATCAGAGGCATGGTTAATAATTACAACAGTAGTCTCTCCTCTATTTCAGTCAAGGCTATTTCCGA 

TAGGTACAATTGCGGTAGTGGAAATACTTCAACCAGTAATAAGAATGTGACCATTAAGACTCAAGGTGAATTGTTAACTGTCCTTCAACAGACTGCTAAT 

GCCTTGTCCGCCTTTACCAACAAGGGAGGCGTGGGTGCAACCCCCGATGCTGCCAATATGGCCAACGTTATTTCCCCAATTGCTAATGCGGATGTAGTTA 

AGAACACCAACGTGGTTGTTTCAGGGCTAGATAGGATCACTGAGACCATCAACTTCTTTTCATTTTTGTCTCAGATCAAAACAATGAACGAGAACATTGA 

AGAGTATCTTAGGAGATATAGGCTAGGAGAAGGACTAGATAAGAAAGAATTGGATAATTTTGTGTATCCAAATATTGCAGCTATTGTTAAGCGAGAATTG 

GGGGTAAGTGGTTCCGCATTGTCCAGTAATCTCGATACTGATCGTCCAATTACTATCGACCTGAACACTGAACAGCCTTTGATCGTAAAGGCTA 

GTTATGCCTCTAACCGCTACGCTAAATTATTCAACAAAACAACAAGAACAGCAGCAGAACAAGCTCAGATGGAGCAGTATAATGCACAAATGGCTGCCAA 

TACTATTCCTCAATTAGTAAACAGGTTGACCATCCCTGGATCCATCACGGCAGACACTGCCATCAATGTCGTTAAAGCTTTCACAGAAAATGGAGAATTT 

AGT AACGC AGAAAC AC ACT TGGGGGTT ATGGGT AACGCG ATT AATG AAATGC AAC CT CTT TT C AC GG ACGG ATTC AACGTTGCG AAC AAGCGTTT AAC AG 

TTAACGTGGGTTCAGTTAGTAAGCTGATTCAGAATGGGTTAACCGTATCTCTCATTCTTGCTCACTCAAAGGCTAGCCCCTATGTCTTTAAGCCTCTCGT 

GCAAGATTTCGCTAAGCTTTTACTGGCAGTCACTGCAGAGACTTCTCTGGTTGTCTCTAGGTCCCAGAAGAGTTTCTTCCCCATTCCTCCTTCAGTATTT 

TCTTCAGGTGGTCTTTTCAAAATTGATAGGGAAATGTTCGATAATATGAAGACAGATTATGTAGTGGAAGTAATTAGACAGCTATCTAAGAATGCTACCG 

CCGCCATCGAAAGGTGCAATGATTCCGATTCAGCTGCTAGGATTGCCAAGTCAGGTGAAATTTATAACAAGGATGTTGCATCAACCACTGCAGCTCCCGG 

AACTTCTTCCTCCGCTTTAACCTTGTTCGCCAATAATCTCCAGAACCCTGCAAAGGTATGGTCTATGGGAGCTCTCCCCCATTTCGATATGGCCGTGGTA 

CCAAAACTTCATGGCATTTCTCACGATCAAATGTTCCGCCTATCTACATATTATCAGGGTATTCATAAGATGGAACTTAACAGCGATTGCAAACCAGAAG 

AATGGGATAATTCTCTTCCTGGAAATAGGGCTAGCAAATTCTTTGGCCTTTCTTCGGTGAGCGATAACAACCGTTCATTCAATTTGGCATTGGATACTCT 

TTTGGCTTCACCTGCAGAGATTTGCGATCTGGTGACGAGGGAAATGGTAAAGACCAGTAACGATATTGTGCATAATATTGGATCGAATTCCAACACGGAC 

GCGCTTCAAAAGAGCCTTCAAGTTGGTGCCTCAGCAGTAGAAAAATACGACGAGTCTACTCTTTCTACTAAAGAAACTGACGTATATTCCCTTGTTTCTG 

CTCTGGCTAAGAGCAAATCTCCTCTATCTTCTTCCTCATCTTTGTCGTCTGAGGGACATCTCACCTCTAAGGAGATrTGATAGGACATGGAACACCCCCGC 

TCTTCTCGGTACCGCTAAAACTACATCCTATTCTGTTTCTGAAGACGCTCTCAATGCTCCTCTTTCAGCCGTGTTGGACTTTAGAAGGAATGTTGTGGAT 

GCTACTAAATCTCTGTACGAAGTTGCAGCTGTT/TGTAGTGTCATGAGTAAAGAG<3AGGATGTGCGTTCTTCGAGTAGAAAGATTATGGGAATGGTGGAAC 

AAGAATCGCCCGTTATGCAAGACATTGGCATTGACCGCATTGCTAGTCTTGTTAGTACAGTTGCTACCCCCAAACAGCATCGCAGATTCTTACAGACAGT 

AAACGATTACAAAAATTATCTCATTAGAAAAGTTGCATCGAATCCCCTTCTCTCTTCAAGATTGGGAGGAATATCCCCTACTAGTGGTAACACCGATTAC 

AACCTTAAAGCTGTATATGATGGTGTTGTTTCTTCCTCATCATCAATGACCCCCTCGTCCATGTCTGTCTCTGACAGATTCTGGTCGGGAGTATTTTCTC 

AGTGCCTAGAGACTGGCCCTTCAATGTTTGCCGATGCTGGTCATGGAGGTAGTAACATGTTCCAAATCACTGCACCTAAACTTTACGGTTCTAGAGTCAA 

C ACCT AC GC AGCTCTGAGC T CTGGC GTTG AGCGGCTG AG AGACTC T ATT TC T TCTGC G AC TC AGGAAAG AAAGAAT AGG ATTGC AAAGAGC ATC G AAGCT 

CTGGAAACGTTCGTAACCGATGTGGTGGGGGGAGATACTTTGGATCAATTGCGTAAGGCCCAGAACATGTACAACAAACTGTCAGATATTACTTCCAACT 

CTATCTATAGTGATTTCGGAAACATTGACTGCGCTAAAATCATGAAGAATGTGACGAGCAAGAAAATGACCGCTAGACAACAATCAGATACTATTCTTAG 

CTCTCTCTTGCACGAACTCGCTGGCCTGGTACACAAACAACAACCTCAATTGGCTACTCAATTTGCTCTGGCGAGCCATGTTATCAAGGCAAAGTATGTC 

ACTAATGACCTCAATAATATCCACGAGAAGGAAACATTCAGTCAATTGATGGCCGTGGCCGGTGTTGCCGATTACTATAATGTGTCGGCAGCTGCCATGT 

GTCAGCGTCTAGTTGCTTCCGACGTAACAATGTTCCTCGGCGGAACCATGCTCCAACAAGGCCTGTTCGTTTCATTCCTTCTTAACAACGTACTTTTCTC 

CCAGGTTTCTGATAATATTAAAATGAACGAATTGAACGATGAAACAAAGTCTCTTTTGGTTAAACTGGTAGGATTTTGCGGTACAGTTTCAGA 

GGATCTAGGCACGTGTCTTCAATTAGACGTGTACAGAACGAAGAGGATAAGAAATTAGACAGGAGTTTTGTTACATCACTTTATTCAGCATACAGAGATT 

TGAGGAAGAAGACTGAACTATACAGGGAAACTGATACTATTAACAAACTTTTCGGACATCAAAACTTTATGTCTTACGAATCTTCCATGCTCAAGAGGAC 

T TCCT TGGT AC ATGACGCT GTTTCCGGCC C T AGGCC AAG AAGGT AC AGC ACC CTTGAGG ATGT ACTTGAGGCTCCTTCC ACGGTTC AC AAAT CGT TCATG 

GTTTCTTACCCAGAGAGGGCAGCTGCTTCTAGGCGAGTGAAGAGGGCTGGACTCAGGGCTCTGGCTGATAACAGGATGGAATCTCTTTACGGGGAAGAAG 

TCTTGAACGATATGAGGTCTTCGGCGGTCTCTTCCGAAATGATGGATATAGAGTATGGTGAGGGAGGATTCATGATGATGATTAGTGATGATGAGGATGA 

TATTGCCTTTATTGATTCCGAAGAAGAGTCTGAATCATCTACTGATTTCTCCTCATCAGATGAATATTCCGATTCATCCGATGAGTATGATTTTGATGAT 

GATAATAATGGCCAGTCTCCTTATTCAACTACATCTTATTCGTATGATGCTCTAGACCGTCTGAATTCTGCCGCTAAGCCTCTTACTGCCATCTACGGGT 

GCAGGGGAGAAGGTGAAGACGATGAGGAAAATGACCTCTATGAAGAAGAACAAGAAAGGAGGAGACGTCGCTCATCAAAGATGGGGAAGATCCTTAGAGA 

TCTTCATGAGAGTGATGATGACGACGATGACTACTTTGATGACGAATTTGATGGCGAACGTTCAATGTCAGAAACTATTGCAACCAGAAGAGCTGGCCGT 

ATTCAATATGGTCCAGGTTTCCTATCTCATTCTAATATTCTTAACCGTCCGGCTAAAGCACGCGCTTTCTTGACACGAGGCAAGAAATTCAGGCCTTCTG 

CGTACGATAGATTCTTTATGGAGGATGACGATTCCCTCCTCTTCTCTGACGAATCTACCACTTCTTCTTCCTCTTCCGATTCTCCATTCTCCTCCTTCAG 

CAAGGGGAGAAAATGCAAGCGCCGAACAAGCGAGGACCAATGTGCCTTTGTTAAGAGAGTTGTACGTGCTTTTGTGCCCACCAGAGTAACAATGATCAAT 

GGTCGAGTTAGCATGATCACCCCAGTGACTAGTGAAAATACAGTAGGATTCTATGAAAATTACCAGAAGGCCAACAAGAGGGAAAGGGCGCGTCTGATCG 

AAGAATACAAAATTGTTAAGGGTGCTTCAGCTACCTTGCCCGACGAATACGTAGAGGGTAGAGCATCTAAACAAGTGTCTCCCAGGGAACTGAGGAGGTC 

TCTTATCAAGGCAGCTGCTTATGTTGCCCGCACCCAAGAAAGTAACTTGAATATTATCTTTGACGCTCTCACCACAACATCAAACGCCACTCTAGTTAAT 

GACC C ATCT ACTCTTTTGGGTG AT AC ACT TTTGTTCGCC AAAC AACT AG AGG C AATTAC C G AG AGG AGG AAT AGGCT AATG AAAG AC CT AACTG AAATCT 

CTCCTTCACTTTTCACATCATTCGGTGATGCAAGTAAAGACACCCAAATGATGGCCGATGCCAAACAGATCGTTTCAGGAGGAAATTTCAAGTCTGCCGG 

CTATCTAGGTGTCCCTCTCAGAACTCTTGCTTCATGTATTAAGGGCACTAATACAGTTGATCGTCTTTTGGCTACCAAAAATAAGAACCATCTCGAATGG 

ATGACCACAGCCGCTATTGTTTTTGCACGTTCATTCAACGATACTACTTTCCATGCACTCGAAGATACACTAAAAATGACCTCCGCTTTGACAGACATGT 

ACAGCGCTTTCACCAACCTCGTCGGATCGGAACATTCTCAGCGTCTAAAAGTAAAGAGTACTCTTTTAGATTCTATTTTCAACACTAGGATGGCTCACAC 

TGAAGCAGTCATGGGTCTCGTATACCCTACAGCGTTCATCAACCATGAAATGCCCTCTGATTACACACAGCGCAGAGAGATGCAATCACTCGCTCTTAAC 

ATTCTTAGGGGAGTTAATTGTAGCCAATTGCCACGAAAGGATATTGGAGACACTGCTGGCCTGTTGACCTTTATTACATCACGTAAATTTGCAGGTTATG 

GAGGAGAAAGGGGAGGTTTGTCTCTATACAGAATGTCCATTGTTGATGCTCTTTCTTGCCCCTCTGACAATCGGCTCAAGGGAGCAGTCTCTCTAGAGGT 

AGGAAAGTGGCAGGATATGGGAGAGGAAATCTTCTACAAGAGGAGCAACGATCTGGTCGATTTTTGTTCAAAGAACAATATTTCTCTGGAAAATGCCGTA 

GGTCCTATTGCTAGGTTTGTTCCCAATGGAACTAACATGGCTGATATTGGCATGACCGATATCATTTCTAGAACAGTCAAGGATGACGCTTCAATGATCA 
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GGCTTAGGCGCGCAGAAGAGGGCGCTGGCGCAGCAGGAAAATTCATTACAGCCTCAGCCATGGGTAATTTGTACGGAGGTATTGATACCGTTGTGAACCT 

AACTGAAAAACTATACGACTCGTTCGTTCTGCTCCAAGATTCAGACTCGTTCAATACACCAACAGAAATGGCCACTGCTATTATCAACCGTATGAAGTCG 

AGGAAACATAAGGCTCTCAAAACACCATTCGGGGGAGATATTGCCACCTATAAGAACTTCCCATCCTCCTCTGAAGCAATTGTAGTTAGAGCCAAGGAAA 

TGCGTAACTCTATTAGCACTATCGTGATGGACATTTCCAAGTCAAGGGGAATCAACTCGTTCTCATCCCGCAGTGGTTCTACTTTGGCCAAGATTTCCAC 

ATCTGAATTTGAAAGGATACTAGAAACATCTGCTGTTCTTTCAAATACAAAGGCCAATCTGAGAACCATTGAGAATAGACTCGCCGAAC^CTACAACAAA 

CTAAAACAATTCAGCCATATTAGTAATGATGGACTTTCCGAGACACGCGCAGTCGTTGCCGTAATTGCTGAATCTTTAACCCCCGTGTATGCGGATGACA 

CCAGCGAGAGAGGAGCATCTGTTAGTGAACTATTGACAGACAATACTCTCCTCAAATTTATTGTTCAAAATGAACTGAAAAACATTGAAGAGGCAAAACG 

TCACGTGACCGCCGCAATTGAAGGTTCATCCCAACTGCACGAAAAAATGTTGAGCCTGCTCGTTGCCTCAGCCGACATCAACCGTATGTCCGCCCAAAAT 

AACCTCGAATGTAAGAAATTGACTGAAGGAAATAGTAACTTTGTACCAATGACTAACGACCAAGGTGGTACATTCATAAAGCACAAAGAAACAGGrATCT 

GGCTGAAGACCGATGAAGAAAATAACACCAGTTCTATCAAGGACAATGATCAGCGTAGAGTAGCTAAAACCATCCTCGCAATTGTAGAGGACAATAGAAA 

TGCAACCATCCGTTCTCGTCTACAGTCTCTTTGCTTTGGAAAATATGCCATGAACGACATTTTTGCACI^GATGATGCCGATATTAAGAATATGGACAAA 

CTCATTGAAAAACTAGGCGAAGCACTCGCAGAGAAGGCATCTCCTTCTAGCTCGGCCATTTCTTCTTCCTCATCATCTAACACAACATCCTCCTCTTCTT 

CTCCCAGTTCTTCCCCATCATCATCATCTTCCTCTTTCTCAATGGACTATTCAAACAACCTTGCCAAAACTATCCCCTACATGCCTATCGTCTTCCAAAA 

C AAAC AATCT AATGT C AATT CT TCTG ACGC ATCATCCTC ATC ACC ATCATCATCTTC TTC TTCATCTGC C AATATTG AT AAT GTTGAGCAC AAAAAAGTG 

GCTCTCCAACAACTTCAAACACAAGAATCTAACGATTTGAGTAACGTACTTTCTGTTACCACCAAGCACAGATTTGCGTCTCATAATCAAGCTGCAACTG 

TTGGCATCTTCAACGGAAGGCAACACGCAGAGACAGTTGTTGCTATACCAAATGCAAACAAGGCTAATAATAATGCCACCGTTTCCGCAGGCCAAGGAAT 

TCTTACCCGCTTCTCAGCCCCTGAAAATGTTTCCTCCACCAGCATGCAATTGCCTCCATCATCATCATCATCATCAAATGGAGATGATAATAAGGTACCA 

GTAACTGTCAGGCTTAACCAGTACGCCAACTCAArCTTATCATCTATTGAAAACGCATCAGAATTTAAGGACTTGAAGGAAGCAGAAAGGAAAATCGATC 

TGGCCATCCAGGCAGCTTCCACCACAGAAACAAAGGAAATGGTCACCGTGTCTAAGTGCCCCTCTGCTAACCAGACTGCCATCACTGCCATCTCTCAAGC 

TAAATCCCTTAAGAAAAGTGCCCTCGAATTATTGGAAAGAGTTATCAAGGCAGTCGAGGTTTACACCCCAGATTCATCTATTGCAGCCGTTTCTCTTCCC 

GTTAATGGAGATTCTATGGTTTCTTCTTCCTCGGGATCAGGATCTGCTCCCTCTTCATCATCCTCCTCCTCATCCTCATCCTCXTCCTCTAATGTGACAG 

ACTATTTCAACTATGCTTACGGAAAATTGAAGAACATTGATGAAAATACTGAAGAAGGGGCAGAAACTGTCCAGAAAAACATGGTCGAACAAGATGCTGC 

CGTTCGCATCCCTCTTCTAGTATCATATGCTCCATTCAGCGAAATGATGAGACGTGCTATTGACAAGTTGAACGAATACTACCAACTGATTGATGCCATC 

AAAACAAAGATCGTGTCAGACACTAAAC^GGCTTCCTCATGGGCCATCAAGGAAACGGACAAGGAGCTTGATATGGACAAAGAACAGGT 

TTAATAACTTGCAACAAAACTTTTCAAACGAATCAGACAAGATAAAGATGGCTATTAGTGTTTTGGACAACAAAAGGAACGAATTAGAGCTTCAGAACAA 

CAAAACTAGK3AGCTTTATTGAAACTACAAAGAGCCGTATCGAGGCTGGTGGAGGAGATGTAGCAAACTTCAAGGAGATTATCGATTACGAAAACACATCT 

GAAAATGACAACAATCTCTTCCAGAGCCTGAAAGCATTCGCTGCTGATAACTCGGGGACAGTTTACACCCCCACTGACATGAGCAATGGAAGAGACACAA 

AATCAGACAGTAAATTTGTCGACATGTACAACAAACAGATTCTCGAGGGAGGAATCAAACTCATCAATGAGGGACAAAATACTGTAAAGGTAGACTTTTC 

AAAGGCTTTGGAGGCTTTCCCTAGACAATCCAACGGTGCTTCAGAGCCTGTATCTTCTTCAGTTGTGGAGAGGAGACAGCGAGAACGTCTTCAGGCTGTC 

GAGATGTTTATGGCAATAATGATGGAGCGCACCGAGAGTCTGAGGAAGAGGTTGGCAGATTCGGCTGCTCAGTGGAATACTGTTAATAATGTAGAAGAAA 

CTGTTAATAGTGGTATGGTTAACATCAAGAGTGAAAGGCTCACAGAGATTAGGAATCAAGCACAAATCGCTGAAAGCACTGCACTAAACTCCATCAACGA 

CGAGATTGTAGAGTCTCCTCTCACCCTCTCTrTGGGAGCACGAGTCGACCAGCTCTTGATCAAGGTAGATAGAGTAGGAAGTATCCAACAGCAGCAACAG 

CAACAGCAGCAACAGCAGCAGCTTCCCAAATrGACAGCTACAGAACAGAGAAAGGAACAACAATACGCTGCAGATAGGGTTGTTTACGATCCTTCATACA 

CCTGCTTCCTGCAACCTCTTCACGAGACAATTAAACGTATTTCTTCTGTCTATAATTCAAAGAACAAGOGTCCTCTCAGTAACACACGTGGTGTTCCCAC 

TAGCGATGCCGACTTGCAACTGATGACCATCACTGACTTGTCTAGGTCTGTACTCGACTCTTCTTCCACTTCCTCCAAGAAAATGCTGTACGAAAATGTT 

CCCTCATCAATTGTTCCTGGACTCTGCCAGCAATGCGCAATGATGATCACCAACGTCCACGAAGCCACTCATACTTCTCCTCATTCATTCAATTTCGAGA 

ACAAAAGATCCCTGAAGCAGCTGACAGAAATGTTGAACGCTGCCACTTCATCCAGTGACGGTCCTGCCGTGAGACACGATGTACTAACAATGTTAGAGTC 

CAACAATGGTTACGTCAAAGATTTTGGATTCACTCACCGCCAAAAG^ 

GTTGCACCTAATACTGTTATCCTTCCTACTTCTGAGTTGTTTAACTGCCCAGGAGTTGAAAATGACAAATTTAGATCCATGGTTAACAGGACAACCGACA 
AGAATGTGGCTGACGCACCCAAGTCATCTGCAAGCATCGTGGAGACTCTTGCTCGCACGTCTCCCAACGCCGAGCACCTTTACTTCCCCTTCAAGGACCA 
GAGGCGACACTTCAACTCCATCACCGACGCCATCATTTCTGGTATGAGCGGCGAATCTTCATCTCAATTGAACACTACTTGTGATCAAAATCTGGTAAAC 
ATTGATCAAACTACTGGCTTCCCAGTGTTTACAGGAAGAAAGCAGGGCGAAAGAAGGATTGTGCACACTGAAAACACTATGGAAGGAGCTCGCAAGGACA 
AGAACAGTG^ATCCCTTCATGTACAAAGGACCGTCAAACTTATATCGATATGGGCACAAAATTCATGGTTGCTCCAGGCTCTCTTCTGAATGCTAACAA 
GGAAGAAACTCTCCGTCTAAACAGGCTTTCAGACATTAACAACGTGAGACATTATGGCACTGATGTTCATGTGGCAGGCGCAAACTCTGCATGGAGAATT 
GGTGAGGTGGTGAGAGCCGCCTCCTCATTCCCTGACGGAGATAAGGAATCGGCTATGAAAAAGATGCTTCTTCTAGGATCTGTATCTGCCArCTCTGCTC 
AAAAATCTGCCAGTCACATTAACGATCCTACTGCTTTGTTGAGCACCAACACGTCTATCCAGAATCTGGTCAAGGAAGCTTTCCCAGACCCTGTTTGTTC 
CTCTAATTACTTGGGGTCTGCTGAATCTACGTTCGCCACTCAACTCGCCTACCGCCAGCGCCTGTTCCCTAACGGAGATGACGAAAACGTTACAACCGTT 
TCAAATATCTGCCCTATGGATTTGATGGGAAGTACAAAGCGC 

GGTCAAATTGTGAAAATCTACTGAAATCTGCCATGTCTAACGTTCCTGCTATCAACACTGCCTTTGGAGCCTTTGAAGAAGCTTCATCTTCTGTCAGGAA 
TAGGCTTTCTCCCCTTTATGAAGACAGCACCAAATATTCGTCCAACCAACTTGCTGTACAGGCCATGACCGATACTGCTGTGGATGCTTTGTCCGCTGTT 
TCTACTGTTGTCGGTCGCCAGAATGGCAGAAATACTCTTCTTTCTCTCCCTACTTCTATTACTTCTATCGCAACCAGTGGCCGTCCATCACTCTCTTATT 
CTTCGGACATGAAATCTAACCTCATCAAGACAATTTCCCGCATCAATAGAGACGCTAGCCTCCTGTCAATGGGAGACAGCCAAGTAGCTGCAGGTTCTTC 
CTTCTTTAACTCTTTCCTTCGTTCTTCTTCCATCCCTGTCACCACCAGCCAGGATGGAAATGTTGCAGCAGCGGAAATTGTTCTGGGGACTATTCTCGAC 
AAGACTGTGGAGATCAACAAGAGATTCGAGATGCTTGGAGGAGGAAAAATGGTCGCCGGGAGTCCTGAAGCTCGTGCCATCCAGCGCAATACAATGTCCT 
CTATTCTCCAGATGAACGAAAATGAACTCGCTCGTGACTTGTGCGAAATTGAAAATAAAATTGAGACTAGGCAACTGAGGGATGCTTTCCAGGATTTGAA 
GAGGTCTATGCTGATGACTCCAGGAGGCGTGGGAGCCATTTCTTCTGGAGCAAGTACCAACAATGTTCCCCTTTCTCTTCTCATGTCACGTGTCGATGCA 
TCCAGCGGTCTTCrGATGAACAACAACAGTGCCAATGTAATGGAAGCTGTGGATAGTTTCAATACTACTCCTrTGCTCGTTAGGCACATGATGTTGGATA 
GTGGAAAGTCCCCCGTTCCCATGGCCAAGGAAATTAGGAGCATGCTGACCCAACCAAGAGCTCTCACCGCCCGCGCTCTACTGAGCGAATCTTCCCCTCT 
TCTCACTGAAATCTGCCTCTACAACACCCGCGACACTCAACCAGAAAGGGCAGTCGACAGACTACTAACTTCAGCCTATCTAGTAAAACAAGCTAAAAGA 
TTCGACGGAGTTGACCCAGCCTTCCCTGCCGCCCTCACCTGCGCTTCTCACCTCATGCTTTCTTCCATGGATTCCCATACAAAGTCATCTTTCATGGACA 
ACATCAAATTGCACATGACTGATACTCAATGCTTCTTCAAGAACATTGAACGATTTGAGAAATTCTTGGGAAGATATGGGGACGAATACGCCATGTCCCA 
CAAGCAAAATTGTAACTGCCCCTTCCATCTCCACCACACTTTTACTCCCTCAGATAACGAGCATCTGGTATCCrCTTTCGCATTCGCCCGCCCAGAAGrC 
TCCATGGAAGAAATTAGAGCCACACCCTATCAGGCCAACAAGCTTATTAGTGACAAACATTACGTGATGAACArGTCCAAGATCGATTCTAGAGTAACAG 
GATCTTCCCTCCTTAAGAAGGTTAGCGAATGGACTGAAATGAGAATGAACTCCAACTTTAATGGAACATTTGAACCATCAAGACTCGCCCTCTCCAACTC 
TGGCATGACAACGGCAGGAGTCAACCTCGACGTTATTGTCAAACCAAATAATGCAAGAAGTGTACTAGGAATATTGGAATGTCATCGCCAGCACGTGTGC 
ACCGCCGACGCCAAGGGAACTGTCGCTTCAGCCATGCCAGCCGTCTTCCAGGCAACCGATGGAAACGGTAACGAATCTGAACTGATCCAGAATGCTCTGC 
C AAGG AAC AG AT AC ATC C AA AAG AGC ACAATG AACGCTC AAACTGTCGTGTTrGC T AATGTTTT GG AACAAC T T ATC GC C GATC TTGGAAAGGTT ATCGT 
GAACGAACTGGCCGGCACCATCGCTGAATCTGTACCAGAAAGCGTATATGAAAACACCAAGGAAATGATTGATAGACTAGGCTCTGACGACCTCTTCAAA 
TCTAATAATAATGGAGGAGTAGAATCAATGGATTATGAAGATAGCGAAACAACATCCAACAATGGTCCCGTCCTCATCTCAGAAGCCATGAAGAATGCCG 
TCTATCACACACTAATTTCCGGCAAGGCAGCTCGCCCGGAAAATGTACCATTCGCCTCATGCGCCAGCGGCCCTCTCGCCTTTGATTTCCTTCTGTCAAA 
GGGAGATACATTCGAAGAAAAGAACGCCGAACAAGGTGCAGCAGCTGCCGTATCCTCTACCTATTCTTCCTCTTCTAACACTACTCTTCGTAAGCATTTG 
GCTCGAGTTTTCGAAGCCATCTCTAAGCAAGTAACTGATCCTGAATTCAAGGAT^^ 

GTCCACCAAATACTAACCAAAATGCCTTTGCTCTAGCTATCAAGAGAGAATTCAGCAGAATTGTTTCCTTCTTAACCATTCTTCGTAAGAACATTACACC 

CGC ATT AGTC GACCCT AAAG GC GCG TT AC ACGAGAAAGT AGCC ATCT ATTTG AC CCTTC TTTC AAC C AAATC AAAACT AG AAAACTTTTTCC AAT AC GGT 

CTCAGTAATTCGTCCTCAGTTGATCTTAGCCATCTAAAACCCATTAATTGTAGCAACAATGTCAAGAATATTGAAGACACATTCATGTACAGAAATGTCC 

ACCCTATTCTTATTATGGCCCTCCCAGAAAATTTCACAGCTCTCTTGCAACAGGAACAAATGGACCCCGATACTGCCATTGAAAGCAGACGCTCCCTTAC 

CACCT/TCCTTAATCATCCCAACACTGCTTCAATGGCGAACGGTGCAAGAGCCGCTGTGGGTGCAGGAGGAGGAAACCCAATGGGCTTGTATCTTTCT 

CACATTCTTCACGAGTCTACCGTCAC^ACATCAAACCCCGTCACAGACACCACAGAAAACGTCAACTATCATTCCTCTGTTACACAAGATCCTGTTATGG 
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T AGT G AACCC CTTCAAGGATTC TGCT AGGTTG AT C GTT AAC AAC AAC AAT ACTGG AATTG AT GT CTTG AATG AT AAGTCGTGC AACT ACT TGC AAGT ATC 

CATGCCATCTGAATCATCTGGCCTCGTCACCAATACTGGATGCTCTTCTTCTTCTTCCTCATCTTCGTCTGATACCTTCAAGTACGTCAGGAGAGACAAT 

ACGCCTGTGAATCTTCCCCGTGTCACACCAGCCGTTCTCTGTTCTGATGCTTCCTCTAATCTCTTGGACGTGTTCTCCAGGGCAGATATTGTCCTCGAAA 

ACATGAACGTGAGATTTGGTTTCATGCCCGAGATTATTGCTGCCGTCTCCAAATTCJUVGGGGCTGACCAAGGAAGAGGTTATTAAGCAAATGGT^ 

G AAC AAC ATC AACAAC AAC AGC AAC AACAACAAC GGAAAT GGG AAGAAAAC AACCGT CG ATCC AGTC ACTGGGG AT ATTGTT AT C AC C AATGCC AC ATTC 

CCCGACACTCGTCCTCTATACACTGCAGCAAATGGAGGAACATCATCATTCAAATGGGGAGAT^^ 

CCTTCTTTATTGGTAACCCAACCGCCGCCGCAACAGCTAACGGAGTGCCTCTTACATCTGAGGGAATTTCCCTCACTGAAGAAAAACGCAAGAAAATCGC 
AGGCATCTCTGAAGGATCAATTGGCACGGGGGCTCTGCGTGCAGCCGCCAACACCCGCCTCTCATCCGACATGGAACCTGTCATGAAGGGATGGAACAAC 
ATTGTTCAGCTTCAACAAACATTCAAGAAAGCTTCAGATAAACTCACTCATCTCTT^^ 

TTATTAACAAGATGCACGACAGCTTCAAGACATTGGAGGAATGTCGTAGGGTGATCCAAGACGAGGCTGCTCTGCTCGTTGCCACCAGCGATCTTTTGAC 
CGGTGGCTACGGAGGAGATGCTGCTCTGGCCATGGTTTCTCCAGTACGTCCAGAAATGACTGGTCTTATTGGTGCAATCTCCGCGCCAGTTAGAGGTATT 
AGCCACTTGTTGAAACTGGGAGGTGTTTCTGCTGCTAACGCAGCTATCCGCAAGCGCCTCAACCTACCTACATCCAACGGGAAAACACTACCAGAACATG 
GAATCGTACACAAATCAGCCAAGACACTTTTGCTTGATTCAGACTCTATTAGCAACCTATACAACACTGATCTTCAAGACGTTGTCTCTAACGCTAGGGA 
TAACAACAATTTGGGAAGAATTATGCAATCTTTGGGACTTAAGGGGAATAATGCAGGGGATTTGGTTTATTCTGCTAGACAACTGACGGACCTTATTACT 
GTACCAGAATATGGAAACAATCGCGATCTTACCAAGCGTCAAGCTATCCTTAAAATGCTCATTTCTAACCCTGAAATTCTAGAAAATGTTGCAGATACCA 
TTTACCTTACAACAGGTAAAAATGCTCTCGCACCGGTATCTGCTCAGGAAATGGCTTGTGCGTCTCTGACAGTCGGAGGAAGTGGAGGAGGAAAACTGTC 
ATCAGACGACAATGTTCAATCCCTGAACCGCCTTTATTTTCGGGTCTAGACTAGTGTAATATTGTAT^^ 

TGT AT GTG ATTTTTTCGGAC AAAT AAAAG AATTG G AAT AAAAGACTTTGT ATT AT TT ACC AAATTT ATTGATTT TTTTT AAAC AC CTTTT T ATGCC ACGG 

GGTTGATTGTAGCATTTCCTCCCCCTGGAGATTTCATAGTGGGACATCCAAAGAGTGTAACTCCTAGTGCTGATACACCCAAACCTAGGAGGAATCCGGT 

AAGTGCAGGAGTAACAGAAAGCATGAAAGCGACCATGCTCAATGCGATGACGACAACGGCCAAGCCGACGATGATGTTTCTGGTCATATCAGACATGATT 

TCTGGTATCTGTTTATAGGGTTTCACCCAGAAGTGTTTTAAACCCAAAAGTATAAATTTTATTCTATACTGCTCATCTCACTCTTCCGACCTCACAAAGG 

AGAAAAGAGTCAGTCTTGCTCCAGCCACCACCGAGGCAGGAGCTCCTCGGACGCTCCTGTATTTTTGTCCAGAAATGGATAAAGTTTGTGTTATATCAAA 

TACTAGGGAGCGCACGTTTAAGGTACCTGCCGATCTACTGTGCGTTGCAACAGAACCGGAAATCTCTACCAAGGAAGAAGATGCAGGTATAGAAATCGAG 

AC AAG AGTGGTGGTGTT TTC AAGGT GC GTCTCGGTCC AGG AACT AC AT AC AATAAATCC AAATG ATG AAGGATTT TCTGTCC AACTT TTC AAGGACTAC C 

TGAAATTGCAATCTGCACAAGGAAAAAAACCCATTGGTTTGTACATCCAAATAAAGGCTGGAGAGGATCTTGAAAGGAGATTAATCAGTGGAGGAACTGC 

ATACCTGGATCCGGCAACACACCTTTTCTATCTTGATTTCTCCCTTTACCCTAATTATTCAATATTCAATGACArTTCATCCCGTCTGAAAATCATTGAT 

GAAGACACGTACAATGGTGTTGTTTTCTCTAACAGTGAAGAAAAAGAGAAG^TGCACTAGTGCTGATAAGGCTGACTTTTTCTACGCATGAAAAGGCAA 

TTGAAGCAGCCATAAAAAAAATAATGCTAAGGAAAGTGTTTTTCAAGGATGGAGATCTTGATTTCGGGTACTTACGTATACCAAAATCTAAACTGGACAA 

ATTTACTCCCTATTTTCGGAGTCAATACGGGATAGTAAATGTTGAAAAAAATATCCCTGGTTACATATGGGGAGAAATTATGAAGCAACGAGTGCGATGT 

TCCAGATGGTACCTTTACAACACCGACTCGGAATGGGAATATAAAAATGTGGCCGAAGAAAGAGTTGGACCTCGCCAGrTAGTGAAAAAATATGGTGCCA 

AGTGTGAAAATTTATGTTTTAGGGACATAGACCTCAGAAAAAAGGAAGCAAAGGAAAAAAGGGATATAGAAAGAGAAACTGAAAGCAGATATGTGGTCGT 

AACACTAACCCATAAGCATGAAATGCCTGAAAATATGCCCTATTTTGGACCAAAGTGTTCAGTGGTGAGGTTG^ATGAAACTAGAATACTTTTATGTTTT 

GTGGATGAAATTTCTTATAATGATGAAGATGTAGACGAAATTTTGTCTGAGAATAGATCACTAAGAAATGTTTCTATTAGACATAAGGAAAATGTACCTG 

TACACACGTTATTAAAAAAAGGTGTGTCTATTCATGCTAGATTTACCCTTAATGGTTTGGATGATGCTTTAATAATTTTAAAGAGAATACCAAAAACTTA 

TTTTGAAGATGAGGAACTACAAGCCGCTTGTGCGCATGTTAACCTTGAACAGTACGAATGGCTTTGTTCTAATAATAGAGGGAATAAAGTAGAACATGTA 

AAGTCGCGGGTAGTGACTCGAGCAGTTAAGCGTAGGAGAAAATGTAGACACTGGATTTATTTTGAT AAA GACACTTT AAATTT AAACTACAAATACTTTG 

ATAAAAAAGTTACTGCTAGTATGGCATCTAAAATATGTAATGCAAAACACGACTGTTTAGTTTTCCATAGAAAAATGGAATTGGAAGATTTGACTGAGAG 

CGCATATTTCAAGGTAGAACCTTCCCCAATAAATTTTGCCAAGTTAAAATCTTGCCCGGATGTTAAATATGTGCAGAAAAAAACAGATGGTACATTTTCT 

GTTATAAGATTCTTTAGAAACATGACAAAGGGTGATCTTATTCAAAGGATGGATCTTTTTTGTAGGTTTATTCCCGACTCACACACTATTACACTTTTGA 

GTAGGGCGGATTTTTATGCATGTAAAAGAGGAGAATCTATGCATATGTGCACAAACAAACACCGTATTCTTCACTACAAATTCTCCAACGCT 

GGCCATCGAACAAATAACCAATATCATCAGTGATACAAGAGGACGTAAGGGTATACAC^TAGAATACGCGATCGAAAATGTACAAGAAATGTACGAAGAA 

GATGGAAGAAGATATGAAGCTAAATACACTGGAACTTTAACCGAGTACAAAAGAAATGAGGACAAAACCTTCAAATC^ 

CTGTCAACAAACCATATAATATTAACCATTTGTATGAGCAATATGGAAATTTTGATGAAGAATTAGAAGACAAGTTGAGGAGTGGTTTCATTTCTTATGA 
C ACGT AT GT T ACTGC AAAAGAC AAC TGGGGCAGGTGT GC AACTGG AAAGGGGGCGTGCATCT AGG ACCC AT AAC TC AAAGTC AAT ACTGGT ATGTGTTTT 
TTGAGAAGGTGTC^GAATGTGCACATTAAAAACATACAAAATGACTACTTCAACAGAAATATCAAAGAACCTTTCAGATGTGTTATCCATCAAGGCAACT 
GGAGATTGGTGCAGTAATATCAAGACGGTATTTTCACCCTTCACAGAAGGCAAGGGAAATTTACCAAACAGTCTCCCGTTTACGAGAAGTCCCAATACAA 
CATGTGGTTCAAGAGAGGCGGCAAACGCCACAGAGCATTTTATCACCGTCT^GCAAAGGACAAATATGAGCGGAAAAGAGTAAAACGTACAATCGGATT 
CACCCTCGACAACACAAAGGAGTTGACGCCCAACAGATACTTGGTAGCAGATCTATACTCTTGG 

CCACCAGGAAAGTCGGGAACATTTGTACGCTACTCTAATGAAGATAAAAGTTTTCTACTAGCAGATACCGGAAGATATATGAAAAAGAAGTACGATGATC 

CAGAAAATAAGACCAGTAGTGGGGGTGATGATGACGATGACGACGATGATGATGATGACGACAACAACAATGTTGACGTGTATGAAGAAAACGACCCCAG 

AAATGTATTCGAGGTCGAAAAGGATGAAAAATATGCCTGTACTTTTTCAAT^TTGGTCTATAGAGCAATGAAAAAGTCTCCTCCTGTATGTAGAGGGTTA 

TTAGTAGAGACAGATGGACCCTCATCTCACCCTAAACGGGCCCCGTCAGCATTTAATCCATTCGGAGGAAGTTCTATGTTGAACGGTTATGGTGCAGGTG 

CAGATGCACTAGAAGAAGAGGATGAAGTTGATGGAGTTCCTGAAAGAGAGAGGATTACAAATTTTGCTCTCAAGAGAGGACCTGCAACTGGCCAGAACTT 

TGTATCTGTTAAACTGGAACATGATGGATCTAAAGCAGACCTGTACAACGTCACGTGCTTCTCCAAGCAGCGTGGAGTATAAAAGGCGGGACGCATCAGC 

AAATGC AAC TCATTCTT TC TC ATC ATCTAACC AT GGCTGGTCGTGT AGAGCTCGT C ACTGG ACCC AT GTTTGCGGGC AAGTCT AC CT AC CTG AAAAAC AT 

ATACCAACAAGAAAATGGAGGCAATAAACATTGCCTGTTTGTCAAACACTCCCTAGAAACTAGGTACGGTTGTGGAACTGGAACAATAGTCACT 

GGAGAAGTGATTGAAGGTTGTACTACAGTTTCTTCTATCAAGGAACTAATCAGTGTGTTACCAGAAGTTGTGGATGTGATTCTCATTGACGAAGGGCAAT 

TCTT C AC GG ATTTGGTGCT AGTC AATAG ACTGGCTGAC AAGGGGAAAAGG ATTGT GATTGC AGC ACTTG ATGGAACT T C TG ACC AGC AAATGTTC AGTC C 

TATTCATAAGCTATTGCCTTATACAAATTCCATTGTTAAGCTAGCATCTAAATGTATGATTTGTAAAATTGATACCAAAGAAGCTCCTTTTACTGTAAGG 

TTTGGTAATGACAATGATAATAATGTTATATGTGTAGGAGGAGCTGAAATGTACGCTGCTGCCTGCCGGGACTGTTACAAAAAAATTAACAAGAAAAAGA 

ACAAGGGGAAACTTGTTGTACTTGAAGGAGGTGACAGGTGCGGTAAGAGTACCCAAGCCAAACTCTTGTTGACCAATAAAAACTCGCCTCTTTATGGAGG 

AG AAT AT AT GTGCTTTC CC G AC AGG AGCAGCC AT ACGGGT AAACTC ATC AATGATT ATTT AACT AAG AAAATTG AAC T AG ATGATC ATGC AGCTC AC TT G 

TTATTTTCTGCAAATAGATGGGAAGTTTGTAGTAAAATTAAGCAGTTGTTAGACGATGGAATCCATGTTGTGATGGATAGATATTACTACTCGGGGATTG 

TTTTCTCTTTAGCTAGAGGAGTGGATACCGTTGAGTGGTGCTCTGCTAGCGATGAGGGACTTCCTC^GCCCGATCTTGTATTGTTGATGCTTTTAGATGT 

TGAAAAGTGTTCAAATAGGGATACTTTTGGTGTCGAAAGATTTGAGACAAATTCCATTC 

GACGAAAAGAATGTATGGATTAAGGTAGACGCTCGCGGCACCATTGAGGAGGTGCAAACTAAAATTATAAATATTGTATATAATATTGTTGAAGAATAAA 
GAATTGTAAACTGTTTCCATGATGTGTTTCTTTCCTAGTGATTTTCACATGTTACCTAGAAAGACTTTGCCCGACACTGAAAATGGTTATTTTGTCTTGG 
ACGAGTCTCTTCTGGAGAAGGTGTACTATGATAACAACAATGAACTGATTGTAAGAGTTGGTGGGATTTATATGCAGATATGCAAGTCAAAATACATCTT 
CCATCACGATGATCCAGAGAGGTTCTTTTATAGTGTGTTGGAGGATTATCACCCCATCAAAGAGATTGTTGAACGACTAGCAGAAGAGGATGGGGTATTT 
TTAGGACCGTGGGAGTTTTTATCGCGCAAACAAGTGAACCTCCAACACGGGTGCTACAAAGCTCTTTTGTCATTGCCAGAGGACAAATATTGTAACCTAT 
TATTACCCCAGCAAATGAAAACCAACCTGGAAAAAATGGAAGAAATACAGCGTACTAGACTCATTCACTCTAGAACGTACAATACACCCCAGATAGAATT 
GTCTGACCAGCTAGATGGATGTGTTATATGTTAACACTTTTTCAGTACCATGTAATATTTTTGATATCAAATTGAAAGGAAAATGGTTCAGAAATTGGTA 
GTAAAAACTTTATTGTTTCTTTCCTCTTTCTTTTGTCTGTCACTTTATTCGTCAGACATAACAACCATAATACTGAGTATGGGTCTTGGATCTTTGTTGG 
GACATTTTGGGTGTGTTAAAAGTATCACAGGAAAAATCGTCTCAAGAAAGGCAATGGATAATTATAGGCTCGTGTCGGTCCTCGGCTCTGTATGTGGAGC 
AGGAATTTTCTGTGGATACCAGCAATATTCTACACCTCAGTCTGCACCCATGGCGTTGCTGTCTGTCTTCTGCTGCACTGCTACTGCTGCTACATCAACT 
ATTGTAGGATTCTGGATATCGATGATTATCCAGGCATTGAAAAGGAGTAATAAAGATATAATATAAAATTTGTGTGTTTTATTTGTTTGATAATACAATT 
TTC ACC T TGT ACCG AAAC CC ATTCTCT ACAAGC AC TGTGTCTTTGG AC ACGGGG ATC AAT AT C TTGGATTTTGG ATG AATGTTC C TG ACC CTCAT ATCTT 
CTTCATTTTCAAATACGAGGAGGGATGTTTTCTCAGTATTCCTGTTGTGTATTTGGAGGTGAACAGGGAAGGAGAATAGTTGATTGTTCTTTGGTGTTAG 
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ATGGAACTCGTATACCATGAATTGTACTCCTCCAATATCTTCAAGATAGATGCATGATGTATTTTTGTCATACCATCTCGTATGGTCGTCAGTGCTCAGG 

TACAATCTCTTGTGGTGGTGGTTACACATCTTTCCACTGGTGGGAGAGGAAGTATTATGGTTCAATATTGATGGAGGATGGATTTTTATACCCTCATGCG 

AATGCGATGTGGAAAATTTTATATGTTTCCGACAAGGGGAAAAATATGAATAAACTGAGCATACACATOTCGTATCCAGATATCTAGGTCGGGAAGAATC 

CAGAAACGATTCAAACCATTTCTGGAACATTCATTTCTGGAAAGGGTTACATTTTATTGTATCTGGTGCATTATTTCTGGTACCATTTTTGGCACCTCTG 

TGTAATCTGACATTGGGTCGACCCAGAGGTCCACCCTCCGAACTTGACATTAGGCCGACCCAGCGGTCCACCCTCTAAACTCGAGTGAGCTGAAAAAATT 

TTTATAAAATTTTTGATGAGGAAAAATAGAAGTAAGATGCTTGTCAATGCATGCTCCCTGTGGAAGGTGGTGGATGAGATAGGAGGGAGAGGCTATCCAG 

ATATCTAGGTCGGGAAGAATCCAGAAACGATTCTAGCCATTTCTGGAACAATCATTTCTGGAAAGGGTTACAArTTATTGTATCTGGTGCATTATTTCTG 

GTACCATTTCTGGCACCTCTGTGTAATCTGACArTGGGTCGACCCAGAGGTCCACCCTCCGAACTTGACATTAGGCCGACCCAGCGGTCCACCCTCTAAA 

CTCGAGTGAGCAGAAAAATTTTTAAAAATTTTTTGAGATGGAGATGGAGTGAAACTTCCTAGGTGTAAGGTGCACTACAGAGGCTTGTGATCCTTTCTGG 

GCGGCTCACTCTCTAGAAACGTCTGCTCCAGAAACACATAAAAAAGATAAAGTACATTTCTGGACCACTCCCATTTCTGGTGTAATCTGACATTGGGTCG 

ACCCAGCGGTCCACCCTCGGAACTTGACATCGGGCCTACCCAGCGGTCCACCCTCTAAACTCGAGTGAGCTGAAAAAATTTTTATAAAATTTTTGATGAG 

GAAAAACACTACCAGTGGAGCGACCGTACACACTCCTCTAGCTAACTACTAACCGAGAGAGTGAGGCAGAGTCTGTCCAGATATCTAGGTCGGGAAGAAT 

CCAGAAACGATTCAAACCATTTCTGGAACAATCATTTCTGGAAAGGGTTACAATTTATTGTATCTGGTGCATTATTTCTGGTACCATTTCTGGCACCTCT 

GTGTAATCTGACATTGGCCTGACCCAGCGGTCCACCCTCCGAACTTGACATCAGGCCTAGCCAGCGGTCCACCCTCTAAACTCGAGTGACGCAGAAAAAT 

TTTTTAAAAATTTTTGATGAGGGAATATAGTACTCCGTAGCCAACATATACACATGAACACATGAGGCGGTCTACAACAGAAAGAGAACTGATAGCTGTT 

CCAGATATCTGGGTCGGCCAGAACCCAGAAACGTTTCAACTCATTTCTGGACAAGCCATTTCTGGAAAGGGGTACAATTTCTTATAACTGGTATATCATT 

TCTGGTATAATTTCTGGCACCTTCGTGCAATCTGACATTGGGTCGACCCAGCGATCCACCCTCCGAACTTGACATCGGGCCTACCTAGCGGTCCACCCTC 

TAAACTCGAGTGACGCAGAAAAATTTTTTAAAAATTTTTGATGAGGGAATATAGTACTCCGTAGCCAACATATACACATGAACACATGAGGCGGTCTACA 

ACAGAAAGAGAACTGATAGCTGTTCCAGATATCTGGGTCGGCCAGAACCCAGAAACGTTTCAACTCATTTCTGGACAAGCCATTTCTGGAAAGGGGTACA 

ATTTCTTATAACTGGTATATCATTTCTGGTATAATTTCTGGCACCTTCGTGCAATCTGACATTGGGTCGACCCAGCGATCCACCCTCCGAACTTGACATC 

GGGCCTACCTAGCGGTCCACCCTCTAAACTCGAGTGAGCTGAAAAAATTTTTGAAAAATTTTTGATGAAGGAATATAGTAGTATAGTGCCCGCCAAAGCA 

TACACACACTCGCCCATGCTCGTCTAGCTGATAGCTGTTCCAGATATCTGGGTCGGCCAGAACCCAGAAACGTTTCAACTCATTTCTGGACAAGTCATTT 

CTGGAAAGGGGGTACAATTTCTTATAACTGGTATATTATTTCTGGTATAATTTCAGGCACCTTCGTGCAATCTGACATTGGGTCGACCCAGCGGrcCACC 

CTCCGAACTTGACATCGGGCCTACCCAGCGGTCCACCCTCTAAACTCGAGTGACGCAGAAi^TTTTTGAAAAATTTTTGATGAAGAGATTGAGTAAAAT 

TTCTTGACGATAAGAGGAGGCAGTAGGTGAGGCTGCTTGTTTGATGTGTCAGCCACATCTGCGTCATACATTATATTTCCAAGAATTTTGCTGACGTCAA 

TGGACCATAAAAGGCTTTGTACGTCCAGAGACAAGTTTTAGTCTGATAGATTTCTTAAAAAAAAGAGGGTGGGAGGTTTGTTTTTGTGGGTTCTGTGTGT 

GTATAAAAGATAGGTGCAAAGGTAGAGAATCATCATATGGACAAAATCTGTCCATGAGACCTCAGTAGAGAACGCACCATGGTTGCTTCAACTCCGTGTC 

CAGGCCCAGGACCAGTTCCAACCCAAGAACTTCTTTCTACAAACTTTCTTGAAGCTCACAAGCTTGTCGTGGAACTTCTTCTCCCGTCCTACAGTAGTGA 

TGTAGTTTATTGTGACTCTGAGACGTACACCAAACCTATACCGATTTTTGGGAACAAGAGTATAGTTTCTACCATTGGAGACTATGTCTTATCAAACCCC 

AATGAAGATGTGAGTTACCAAATGGTTTCTTCCGTCTTAGAAAAATTTCCCTTGCTATTCCACTGCACTTATAAGACGAATGAAGAAGATAAAGGTATTC 

CTCTGTGGAAGAAGTTGTACAACAAAAGAAAATTCAAACTCCTCAACTCATTGTTGGTTCATAACAACAAGAACTGGACTCCTGTTCCAGCTATCCCGTT 

TGACAGGGAGAATATATGTGATGCTTCAGGAAGGAGTGTTCTTATGAGTGAAATAATGTCCACGTCAACTTTTCAGACAATTTGCAAAAACAACACACAT 

TACTTGTTTGATATGTTAAATATGGAACGTGGCAAACAAGGAGGGAGTTTTCTTCACTTCTTTGCATCTAGGAAGAATTCTTTTACTAACTTTGAAAATG 

AAGAAATGGACTCTCATGTGCTCAGTAACATAGCGAAATTCATATGCAATGAAAAGGAAAAACTAGACTCTTTCATACCTGCCAACGGAAAAATACCATG 

CCCTGATAAAACTAATGATGAAGGGTACATCCCGCTGGAAATAGCAATTATGGAAGACAATTACCCTGCATTGCTATATCTCGTTTGTAGGTATGGAGCA 

TCTTGGGCAAACACATACGGGGATCATAATGAATCTCTCAAAGCGTTTGCAATAAGAAATGATGCAAAAGATTGTCTGGAAATTATAGAGTTTATAAGTG 

ATCACTACAGTTTCAACAAAAATGTGACGAAGGAAGAATTTGTTAAAGAGAAGACTGTAGAATGTGTTGGATGTTTATATGATATTGAAGACGAGAAACG 

TTGTTACAAACTCCCATGTGGACATTTCATGCATACATTTTGCTTGTCTAATAAGTGTTCTAAAGCTAACTTTAGATGTGTTAAATGTTTCCAAACCTTT 

GATGACACAATTTTTAGAAAATGTCCCCCAACTATACAATGGAAAATGGGTATAAACCAAACGACTAACCATAAGGAAATGGATTTGTTCAATCGTGCAT 

TTGACACATATTTAGATTTTATTTGCTCATATAACGTCAAATTAGACAAAAAATCAAAACCTAAACACAAACCTGAAAACAAAAAGGTGGAAGAAGAACT 

AGCAAAAAGGACAGCAGAAATTGAAGAGGCCATAAAGAAAAAGGAAGAAGAACTAGCAAAAAGGACAGC^ 

GAAGAACTAGCAAAAAGGACAGCAGAAATTGAAGAGGCCATGAAGAAAAAGGAAGAAGAAGAACTCTCAAAATATAATAA 

GACGACTGAATGAAGAATGTGTCAAGCTGAGAGATATTTCAACTGCAGCCATAAACATGTACAAAGAGAAAGTGAGAATTAATGGTGTATTACTAAAAGA 
TTCCGATCAGGAGTTGGCTGAGGCGAAAGAGAGGTTGAGGAAAATTTTATTGCTAGAAGAAGAAACAAAACTTGACAGATTTTTGTTTAGACCGAAACGA 
GTAGAAGAACGTATATTCCTAACTAAAGATGATGAAACGTTAGCCTTCAAGTTAGCCCTAGAAAAGAAAACGGAGGACATAATTGCGAAGAAAAACAACC 
AAAAAGGCAGTG AAAG AAGAG ATGG AG AAT AT ACT AT AAC TTC TC AT ATTGAG AAAC T AC CTC AATCC ACTGCTTTGGC T AGTGTGTGTGTGTT AAAC G A 
ATAAATAAAAGTATAAAATGTAATAATATTGTTTTAATATCATTAATATTTTACCACAATGTCTACTTGTTCGAATTTGTTGTCAGTATTTGGTGGAGGA 
GATTGGACAACAACATTCCCATTCGACCTCGTCCATACACGTCAAGAGTGTGATAAAAAGAGAGAGCAAGACTACTCATTTTTCATTACTGAAACGTGTA 
AAGGAGAGAATATTGGTATACATTCGTATGAACACACGTCAAAGATTATTGACACGGGTAATAATGATTCTACCTCAATAGAGGAACTAGAAGTACTGAA 
TATATACAAAGCTATAAACCATTTAGAAAATATCCTAAAACTCAACAAAGGAGAAAAAATT^ 

AAAATTTTAATGAAAGGGATTCTTCCCAAGGGTAAAAATGGAAGTTTCAGTACATGCGTACGCTTTGCTGTAAATAAGAACAATGAACGGCATTACTACC 

CTGTATTTGAAACAGAGAAAGAAGCGTTCAATTCTATACAAAATCTAGTAGATTATTATAATGAAATTGTAGCTCACACCAATGACCAAATTAAAATAAT 

AAAAGCGTGCGCATATTTCATGTACAACTTTCTAACTCTCCACCCTTTCAATGATGGTAATGGAAGAACAGCTAGATTATTGTATAGTTTTCTATTGAAA 

GGTAATGGTATCGTACCTCATTTTTCACCCATAACACACCCTAGGGATCAATTTGTTGATACTTTAGTGTATTTTAGAGAACATGGAGATGGACGACCTT 

TATTGTATGTTTTGCTGGAATCAATAAAAAATAAGTAAAATTCATTTTGAGGCATTTATTTCTTCATCAATTCATATGGCATCTTGCCAGTTTTCTGTAT 

TGTTATCTCAGCAATAATGCGTAACACATCAGCAGATGTGCAGATTTTATCCCCTTCCCCATCCCTCATGGTATTTCCTCCTGCAAAGTAATGGAGGAAA 

GAATCTCCAGTAATGGGGTTAGTTACATTCAATACAGCAAAGAATTGGCCAATATCTTTGTCGGCCAACTTGTACAAAAGCTCCTTGGTCATGACTTCAC 

TCATAATGATAGATTTTCCAGTGGTGTCACAAATGGGGGATTTATCAAACGGCATGGCGGGGATATTGGGGTGGAATGGGTTGGAGAATAACACCAGGTC 

GCCGAGGAGGGCAAACTTCCCGTGCTTGTAAAGCTCCTTCCACAGGGGCGTATTCGTCTGCTTGTAGTTGCAATGAAGTAGGACAGGGAATCGCCTGAGT 

ACGTGGGACAGCACTTTGAAGCTCATCTTTTCGTCCGAAGAAGAAGACAGTACACTCTCTCCAACATCGGTGACAATATTCTTCCCTCCCGACTTGTTCT 

TGTACAATGGAGACTCGCAATCGGTGAACATGGCGTCGGTAGGGAGGAGCATGGTTTGCATGAACATGCTGTCCGCGTTGAAAAAGTCCTGCTCGGCAAT 

GGAGAATATGTTTTCCAATTTGCTTATAAGTGAGTCCATTGCTGCTGCTTGCTGCTTGCTGCTGTTAGGCGTCCAGAAAGCGTGTGTTAATTTCTCGGCG 

GAGGACTGTTCTTATATACAAATTCGTTAAAGGGTGACGCCCACAGATAAGAATGTGGTTAGACTTGAATCTATTTGAAACCATGACAAGGCATTTTCTT 

TTGTAAGGGGTGAATAAGGACTAGACTCCTCTACCAGAATTCATGTGCGTCATTGCAAGCAAAAACTCCCTCTTGTTCCAAGAATTGGCTAGTCG 

TGCATGCCAGTCTGTATGACATCATTTCCTTATGTGCAATGGATAATTATAATTTGTAATATTTGAAGTCGTCGGGTCCGGTCCACCCAGAAAGGGGGCG 

AATACACATAACATTTCTGGGGGTGTATTTAGTGCATTATATCTGGCATCC^GAAACGGATGCAAGCAGTATCTGAAACTCTCTCTCACTTTTGACATCA 

ATGAAAAATTGGACATCAACACGCATTGGTGTGGTGGATAAAAGGGGGCAACCGGCTATGACTAACAAGCATTTCCTGCCTAACCTGCAGCCAAGCAACA 

GCATATTACACAATGGAGGACCTAAAATCCACTATCGAGAGAGTATATGAAGAAAGAGTGGAGAATCTAGAACAATGGACAAATACTGTAGAGGAAGAAG 

AAAGGACTGTCTC^GCAATCGATTCTGTCCTGGAGGAACAAAAAAGGGCCCTGGACGCATGGGAAGCAGCGATAAAGGAACGAGAAAACGACCTCG^ 

AAAAGAAGGGATATCTGCACTCGTTTTCAACGCAGCAGACGCCAAAACACGTAAAGAATTGATAAATACGTGGATAGCCGAAAGGGAAACGTCAGAAAAA 

AGAAGAAAGGAAGCAACCTCTACCAATAATCAACTGAAGAACCAGATGTCATCTCTAGTCAACACAACCAAAACACTCAAAGAAAAGTACAACAAATATT 

ACAGAAGAAGTGCCATACTCAACATGCAATACATCAATAACAAAAGGGATTATGAAGCAAGTCAATTTTGGGTGTATACAAACAATGCATAAGTTTTCAA 

ATAAATTTTATTTTATAATAAAAGGTGTTTTAATTATAATTTTTGTCCCTGATGTTGTGTTTTCTATATTTTTACTGCCTCCTCTTGGGGTAAGACATAA 

AAACGGTGGCGGCGGAAACGAGGAACAGAAGAGCGGACCCAGCCAGAAGCATCATATCCCTGGTCCTGTTCTTATATTTGTCCTCATCATCGTTATCGTT 

GGCAGTGTCGTCATCATCATCGGTGTCCTTATCAGTGTCAGGATCGCTGTCCTTCTTTGGTCCCATCCATACATTCATGACGGCCAGGACGAGGATACCA 

ATTGAGATAGCAACGATCAAAAAGAGACCCATGCGAGCCATAGACATGGAGCCTTCAAGATCTTCCATGGTGTAAGAAGGGCCAGCGGCCTGGGCTCCGG 

TCCTGCCGAAAGGAAGAGTGTTAGTCGTGGTGGCCATTTTTTTTCTGTTTGATGAAGAGGTACGTCAATGTAACGTAAAGACCTGTTGTTTTATCGTGTA 

CGGTTTTTGTCAGGACACCTTACGAACCTATTTGTCTTTTTAGACTTACTCAGCGTTAGTTGTCGCAGTTGGTCATGTTCCGTCAATTCTGTTCACTATA 
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TCTTCTTCAACGCCGGGTTAACGACAATCTTAGATCCACGGCGTCCGCTTCGGCAGCGGCCTCATTAAAAGGAGATGGAACTGAATTTATAACCGGAGAA 
CCACCTTCTCATAAAATGAGGGGACCTTCTTATAGCGTTTTAGGACCTGATCCGTGCGAGGACCCAGAAAGGGrATATGTTGATATTGTAGTGTCTATTT 
TGCAGACAAATAATATACAGGTAACAAAAGAATGGGAATTGTTTTCCGATAAGTTGAGAAAATTGGGTCCATGGATTGATAGGAGCGGAATTGAGAATAA 
TGGCGAAGGAGAAGAAGATGGAGATGAAAATGAAGACGGGGGTGGAAATGGGGGAAGAATTGAAGACAGAGAAGCACATCGACGAAAAATGATGAAGAAA 
TTGTCCTTTGTTGGAAGAGAAGATCCAGTCGCTGTAGATTTACCCACGTGGCGAGAAAACAGTACAGAATTTGCACGTCGTTTAACACTCAAGGAATTGT 
GCGATTTAATAGTTGAATGTGGATGCATCAAATCAAAAGAGGAACTCTTTGACTTCATTTTTGAAGAACCGTGGGAGATTAAAGAGGCTGCTGACGTTAG 
GGGTATGGCAAACAGGAGTAAATTCACCAAGGAATCATTAATTGACTGGTTTTTTGAGTTCGACACATATAGTAAATGTGTAGTATTTTTTGAAGCAGTC 
AAC TGGT ACTTG AAATCTC AAGCGTCTCC AAT TT CAT TGGTACT AG ATG AT AT AT AT TGT TGTGTCT TTTCC T AC AT AAG ACGCC AAAC CTTTTT AACT A 
GGGCAAAAAACCCATCTTTAACAGTGGCTTCATCCTTTTCTCCCACGCCCGACACAAAGCTTTTGGC^ 

AGACATTAATATTAGCCAGATGGCATTAACTGAAAGGGACTGCTTCTTCCCTCTTTTAACTGAAATGCCCCGCCAACAAAAAAAAGTAAACACCTTCCTG 
GACACAATGAAGAGACCTACCTTATCTCTTCTACCTTCCACCTCCTCCTCCTCTTCTTCC^ 

TTCTTCCAGTGTACAGGAGTAACTTTTCTACAGCATCCAATAACAAGAGACTGAAAACTGATGATGGGGAAAATGCATCAGCCTGTATTCTTATCGAAGG 

GTATGCGAATGGAAAAATAAGCCCTATAAGGATTATGGTAAGAAAATCAACTATTATTCCAGAAGTC^ 

AAAGACACTGGTGCGAATATCTTATTTTTTATCAAAATGAAATCCTTTGCAA 

TCAACGGGCCGTGCAAATGGATGACTCTAGCAGAAAACAACATCAACGACAACAACATAAACTCTTCCACGATGTGGAGTTACACGCTAGCAGATTATTG 
TCCTCTGGGCTATTACACCCAAGAGAGCCCTCAACCCTATCAGACATGCGGCAATTTTACTTCGACTAa^CAAGAGACTACAAAACGTGCAGCCATTA 
TACTTTTAAACACTCTTTTGGAATACTACAGSACACCTTCAGAAGAGTGGGAAATTCCGTTTAATCTCTTGCTTAATGTGATGAATAACAAGTGGAGTAC 
ACTCATTCCAGGTGTCAAAATAAGTGCAGGTATCATATCGAAACTCCCATGGACCATGAAAACAATGTACGAGATTGTTTCTTCGCCCAATAATAATAAT 
AACAACGGAGACTACTATTCTACATGC^GGCGAATGGTAATGGAATATCCTATCGGGGGTT^ATTGCACAOSCCTGCCATAACTAATAAGTATCCACGCT 
CCAGAATGCTCACCTGTACAAAGGGCAAAGACCACCAGAAGCTATATGACATCTCTAGACAAATGTTTGATATAATAGAAGCAAATGGACAACTCTGATT 
ATTATTATTACGGTTACGAAAATTCCCCAAAGACTAAAAAATTCAATACTGCACTGATGATTGGGTTCATTTTATTTATTTTATTAATCATCGGAATTTG 
GGGTTTATTTAGATCTCGAAATGCAACCACCCAAGAGAGCAAAACTTCTTCCCCAACAATCTCCTCGACCCCAACTACATATTCTGGCAGCTCAACCAGC 
AGGGGTCCAGGTTCTGGATCTGGAAACAAACCCAAAGATGACACATCCGTTGAAGGAATAGACCCTGGCTTACTGTAACAGAAAAAAGAGTAAAAGGCGA 
CAGCTCGCTTGCCAATTGTCCTGTTACGTACTCTGTGGTTTCACGAGGTTGTCATCACCAAAGGTAACCTTTTTTTTTGTCCTCGCCGACAAAACGACAT 
CTTAATAACCAAGCAACGTTCGATAAAGAAAAAAACTCGTCATGGATCTTTCTTTCACTCTTTCGGTCGTCTCGGCCATCCTCGCCATCACTGCTGTGAT 
TGCTGTATTTATTGTGATTTTTAGGTATCACAACACTGTGACCAAGACCATCGAAACCCACACAGACAATATCGAGACAAACATGGATGAAAACCTCCGC 
ATTCCTGTGACTGCTGAGGTTGGATCAGGCTACTTCAAGATGACTGATGTGTCCTTTGACAGCGACACCTTGGGCAAAATCAAGATCCGCAATGGAAAGT 
CTGATGCACAGATGAAGGAAGAAGATGCGGATCTTGTCATCACTCCCGTGGAGGGCCGAGCACTCGAAGTGM 

AACATTCAAGGTGTG^AACAACACATCAAGAAAGATCAACATCACTGGTATGCAGATGGTGCCAAAGATTAACCCATCAAAGGCCTTTGTCGGTAGCTCC 
AACACCTCCTCCTTCACCCCCGTCTCTATTGATGAGGATGAAGTTGGCACCTTTGTGTGTGGTACCACCTTTGGCGCACCAATTGCAGCTACCGCCGGTG 
GAAATCTTTTCGACATGTACGTGCACGTCACCTACTCTGGCACTGAGACCGAGTAAATAAATCGTGCTTTTTTATATAGATAGGGAATTTTAATATTACA 
ACAATAAGAAAATAAAACAATTGAGGAAAT^TATACCATATTTTATTGACCTACTTAACCTTCTTGCTATACAATGAATGTTTAGGTGACTGGAAAAGTT 
TAGCAATATTATCCTTGAACGGGAAACATGCACCAATTACAGGCGCAATTTCATACGCTCTCGGCCTATTGGTCTTTTCCTGGTCATACATTTTAGATAC 
AATAGACAAAAATGGAATGTTTGTATAGATAGAATTGGCAGACAAATCTGCAGTTCTCTTAATCAAAATGGACAACATGTCTATTAACAAATAAGCCAAC 
CCAAAAGTCATGGCAGTTTCTGAACACAACTCACTGTTAATAAATTCAGGAGCTGTATGAGGATGGTTACTAAAGAACCTCTCATCAGTTCCCCAACATT 
TAAAATTGTAGTACTTTTTACATGGTACAATTAAACCAAAATCAATCATCTTAGGTTGACCAGTTATTCCATCAATTACTATATTGTCACTTTTTATGTC 
CGGATTCACTAATCCTTGTTGAGACAACCGAGTGACAATATTTACAATTTCTACCAAAACAAAGGGCM 

ACGATAGCTATGGCTGAAATTGAATCCGTAATGGGTTTCTTGCATTTAGATTGTAGACCTTCAGGCAGGCGGCCAGTAGCTTGAAGCATCCTAACACCGT 
ACAGAGTATCCCTCATCCTACACAAGCTGCCATGATCTTGCATGAGCTCAGCGGCTGGCTTGGAGGAAGTAGTCGAGGGGTTAGGTTGCTGTACAACATT 
CTCGTCCATTTTACATTTCAGGTCTCCAT^ITAATACATCAATCACAGAAATGCCTGCAAAATCCATCTCGACACAAAAACCTTCAACACACATCTTTCTC 
ACACCTACTGCGCCrTTCATTCTTCCCTCAAGAGCGTGACTAAAAACAGTTTCAAATACAAATTCACAGTACATTTCCTTGTCCGTCATAAACTTGATAA 
CTTTTCCAGTTTCAAGCATGTAATATACCCCGACTTTAACTTCGAGATATAATTCAGATGGTCCAATCATTACGACTTGAGAAGCATATTCTGGTTGGAC 
GTGTGCTGGTACAGGAATATTTCTGCGGTAACTTTCCCATTTCTTACATTGGGCAGTGACCTCCCCCAAGAGTGTAGTTTTCTTCTCCAATATAGATTCA 
CATGCTCCAGGGGAAAGCTTAAAAACATTCTTTGCCATGTGATCGATGCTCATGGTTTTTATCCACCCTTTACCATTATTCTTGCATAACCTGTTCCAGT 
TCAAGAGAGGGCTAGTGTCTAAAACGGATAGGTTTTTACTACCAAACCCAACAGGCCACCAAGATGAAGGGGGTTGTTTAGTACGGTTGTTAGACCACAA 
AAACTTGGACGCTTTTTCCTTTGCGTCAATACAGCTCTGTTTTATGGTCTCGTAGTTGGACGACACATCTTGCACGTCAGAAATGAGGCCTAATTTTGCG 
GCCTGTCTAAITCAAATGGAACGGAGATTCTAGACCGGCTGCAATATTGGTATCAGCCTCTGCCACTGCAAACT 

ACTTGTCAATGGAAGAAAAGTTCCTCTTTACTACATTTGTTTCAGTCACGGTATCACAAGAAGATTTCAAGTGTTTTGGCCCCTTGACATGGGAACGGAG 
GACAGACAGTGGAGTATAGTCTCTAGCCTTCAAATTGACAGTAGAGAATTTTGCACAATTTGGATGAGAGAATCTTTTTCTGTACTGGGTGTCGGCATTG 
GCGCGGTGCTTCCTCAACACAGACGCGGCACTAACAGAAGGCAATTGCTGCTTCTTGATAATGTCTGTCGCATACGTTGGATTTAATGTTGCAAATTGAT 
TTTCATCAATAGTAATAGGCCTCTTCTTCTTATTTCTATTAAAGTTGTAAGGTGTGCTCTCGTAGT^ 

TTTTCCTCCCACCTCTCCATTATTCTTCTTTAGTCTCTTCATGGGCCTAAGAGAATAAGGAGAATCTGAGGGAGGGGATCGGCGAGTAGCGTTTTTAGAG 
CAACAAGATTTCACAGCTATGGCTGAAGGTATCTTGAATTGCCCTCCTCCTT<rrCCTCCTTCTCCTTCTCCTGGCGTTTTCGATTGGTAAAGTCCCATCA 
CGGTTGCTGGCGTAAGTTTTGTCCGTTGGTCCCCACCCTCCATTTTAACGACGTTTTTTATACTCTGCTCGGGCTCTGGGTCAGAGGGGGCTATAGAAAT 
CACTCGTGGATAGTTCCTTTATATATTTAACCAATTTTTCAAACAGGGGTTATTAAAACTACTCTTCTTTTCTGTATATATTAGGTACAAACACACAATA 
CAGGTCAGGCCTGCAAAGGACTGGGTATTTACCAGTCCGAATATAACAGTCAAAAGGCTTATTTTCCTTGATCTAGAGCTGGAAATGAGTGAGTAAAAAC 
TTGCTGAGACATATACTGTCTGGCCGCCCCTTCCTTATTAGCCTGAATAGAATCCTCGAAGCTTACAACTACATTAAATTTCCCCATACCGTTCTGGGTT 
TTCAACATATTTGCAC^TCCAAACGTTAC^GTTTTCATGGATACTACACCCTCGTCTCCTAACAAATCACAGTTCCTACTATTTTTGTCAGCCGCATTCC 
TTTGCCTCTTGACAGATTCTGTTACAGCGTCCTGGATTTTCTTGGCCCACAGAGCTTTAATAGATCTCTCGGACAATTGTTCACCATACCCTAAAACAGC 
ACCTCTGGCAGTAATGGGGCAGTAATTAAAACAGACAACAGAACCTCCTTCTCTCCAAAT^^ 

CTTCTTAGACAGTTTGAAATAATGTCCAATTCTTTACTCTTGTGGGAAAAGGGTCCTCCAATAAATCGACCAAACACACCCGCACCAGAAGAAAGG^ 
CGTTAGAAGATGGTTCATTAAACATCATACAAAATTTGACATCAGAAGGAGGT'TTCCC^ 

CTGTACAGGGTTCACCTTTGAAGCTCTAACTTCTATTACAGCATTTGATATGTCAGTTTCCATGGCCTTCATTAATTTGGGGATATTCCTATTAACAAAA 
TCACACATAGGTATTTTCCATTCAGGGTGGATGTGGGGTGAAAGAACAAGCTCTAGAAGCACTTTAAAGATCTAATCGCC^GTGATTGTTGTGACTGXSAG 
TGTTGATTGGGTATAGAGATTCTTGGTGCTTCTTTTCCCTTTCTGATGCTTCATCACAAACATCCATGCCTTCACAGTCATCGTCATCATCATCATCATC 
ATCT TC ACTGTCCG AC ATTTGGTGGGC ACT ATT AT T ATT AGTGT TCTTG ACACTGTT AC C AATT TTC TTGTACATTT TT TC AAT AGTT AAAAC ATT AGC A 
TTATCAGCACTCTCATTCTTCATTATGTTCTTCAGTGAAAGATCCCTCTCCACGTTGTTGTGTTTGGACACGTAAGCCTTGACGATCTTCCTCCATGTCC 
TTATAAAGTGTATGACTGATGCATATTTCTTACCCGTAACAGGTTTATGAGTCAGCGAGGCTTGGTAGAGCTTGAAGGCGTTTGTCCAGTGATTATATAC 
AGTCGTGTACAAGGCCTCGATGGTTTCCTTTGCCAGCTCAGGGGATGTAGTTTTACTCCCATAT^ 

AGAATGTCCTTCAATAAAGGGTCCAACGTACTCTTATTTAAAATTTTGTGTTCGAATTCATATATCAGCGAACAAGTTTTCATGTCTTTGTATGACACAC 

AACATCTCACAAGACAGATGACTGCAGTCCTCAGAAACGTATCACTTTTCCCACTTCTTTCTTTCTTCGTGTCAAGAAGAGGTATAACGTTCTTGTACCA 

GTAATCGAGTATAGATTCTGCATTGGCTGCGATGGTTTCTTTAACAGCTTGGGCrAAATTAGATACAGTTCTGTCCTCTCCTTCAAAAATCACTTTGTCT 

TCTTCTACCTGGCGGCATTGTCTTTTGTTTGTGTTTTTGTTTCCAGAGAGAGTAATTGCAGAATTTTTCTGTTCAACTACCAGAACTCTATAAAGATTTT 

CCACTTTTTGGAAGAGAGATTTCTGGCGGTCATCTTCAACAACTGGAGCCATGTCTGTACCGGAGCCAACTTTGTTCACAACTAACCGCTCGGTGAATGC 

GTCTTTTAAAGCCATTACGGTCCATGCCATATCTCAAAGGTTGCCAATGACCAACATAAATATCGGCACAGCATACATTGAATTGCCTAATACTATAAAG 

GAAGGACTCGATAACCCGAAACCTATATACTTTGATCCTTTATCTAAAAAATATAAACCGGATACTTGCACTGACGTGCTGGACTCGTTGGACGTGTCGC 

TAGAGGACTATAGAAAAAGTATAGGATGGCCTCCTCGGGGGGATTCTTTACAGGAATAGATGACCTTTTCAAGACAGTGATTCAACAAGAAAAACAAGA 

AAAAATAAACCAACTCAAGCACCAGAAACAGAACCAAAACCAGGGCCATCTC^UiGCTC^ 
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TCTGTCCTCCTCCACCTAATCCTCTCCCTCCTCCTCCTCCTCCTCCTCCTAAGCCCTCAAGAGAAGAACGGCTARAGACGTCAAAAATACGTTTAAACAA 
AGCTCTTAGTGATATTGTTGAAGCCACAAACGAGCGTGTTGATGCGTTGA 

CAGGTTTTAAAGTGCTCGATAACACCTTCTGTACCAACAGCTATTATAGGCGCACACGTGAAACAGGTGGCCAAAAGTAGCGAAATCGAACTGGCCGTGA 

ACGAACTCGATATAAAAAATAAGTGCTCTTTAGTGTACAACGAAAATGAGTCGTTAAAATTTTTCAGGGACCATGAGAACCTTATACTACAAATTGCCGT 

CCAGTTATTCTCTAGGCACGATAAGACCAAATGCGTGGGGGCAGAAATATGTGTTAAAGGCAACGAAAAAA 

AAACTCCCCAATGCACCATCCTCATCTTCAACTGTGCTAGAAATTAGAGGCGCTACCAGAAATTT^ 

CTGTCAATGAAAACAAGCACATTCCTCCTTCAGAACGAGCCAACCTGGACACGAC^ 

CACTAAAAGGAAGCTTTTCTTTAAAGGCAACACTTTTTATCAACGAAAACCAACATTCGATAATAA 

AGTGAAGCATCAAAACAAACCACTAAATCGCTAGACAAGCCAACGGACGACAATTTATT^ 

GTTTGAAATTTAAAAACGTCCTCTATAAAAATAGTACCGCACATCCCGGCAA^^ 

TTCGGCGAAAGAGTACAAGATGGTCTTTGCAGAAATCGACAAGTGTTTGGATGTTCTTTTGGCCATAGGGAAGAATGACAAATACACAAAAAGCACTGTC 
ATACAATATAGAGGAAAGTTTAGAAGGTATTTAATATTCTGCTACGCCTTTTATGCTCTAAA 

TTAATTTC1TTAACCTTTTCTCCTTCATGTATTGTCATGGTCCGTTTCTCCATTCCGCCAGTTTTTTGTCCACATTGACGTTCGTCTATCAACACATGTT 
TTTCCCCATGGGCACAGCCGCCCCATCCGTCTCAGCCAAGCGGCTCATGGATATCGATTCCGCCCTAATGAAAGGAGGAAAGGGGGTGGGTGTGAGGGAT 
TTTGGTTCACCTTCAAAAACAAGTCTCCATACAAGAACATTGGTGTCTTTCTTAGGTrrTGCTGAAATGGCTATGGGAACAATGACGGCTCTCTTATCTG 
GTGTAGAAGTGCGTGTATCTCCAGCTCTCCAACAAAGGATATCTAAATCCCTAGAAAGATGGTGTGATTCAGTCATCTTTATATATTTCACCTTTGTTTT 
ATTCCACAGATTCAGTGGTGCGAAAAAAGTATCACTCGAATCGGCGCTTCGCCTCATCATGGGGCAGACGCACGCCCACACAAATAAGGTGAGGGCCGCC 
AAGAGATGCCGAATAGAAGCAGCGGAAATGGAAGGTGTGGAAGAAGAAGAGGCGGGCCTGACACTCTCTTATGCCCATCTATTGGGTCTTCCTTACTCTA 
TACAAAAAGCCCTCGGATTACCTGTCCCTAAGATAAACCCTCTCATGACAGCATCTTCTTCTCAATACAArrTAGGGGATTTTGTAGGCGTGGAACAACT 
TCTAAAGGCTAAGAGAGAGTTTCCAGCCGAAGGAGAAACCGCAGGATTTCTCGGCATGTTTGATAATCTAGTGAAAGATTCTATTGACAAATACTACGGC 
GAAGGAGCCTTTTCAGACGTGGTTGAAAATGTAAAACAAGGCATGGAACAAAACACACCGTATGACACATCTTCAGCGTTGATGACACCTATCCCTAAAG 
CATTCTACGAAGAAGAAAAGGATGTTCCACAGCAGGAAGAAAATTCTACACAACAAAGATATAGATTGAATAGAGACGTGGAGGAATATTTAATGGCTTC 
TCCTATGAAGATGGTGTTTGTGTCTATACTCGATAAAACTAACCAAAAAGAACGTTTCATGTCTGTTGGGGATATTGCCCTTCTGGCCGTGTGGTGCAAA 
AGGAACGTACTGAAAAAGGATTGGAACGAATACGCTATCGCTAAAGGCAACTACGAATGGCTTGGTGCTAAAATGTGCAACCATTTACTTTTAGCTGATT 
TAGTGAATTTTGGAATATTAGGTGACTTGAAAATAACCAATAAACTTGACACAAATACCGACACCTTTCACAGAGACAGTGATAGATTACCCTCAGTTGC 
AGATCAGAAAAAATTTATAAAAAACACATCCCTATCTGATCGAAAACAATTGGCCCTTGTTCACTCGTGCGTTAACGTGAGCACCCGAACCCACGTAGGA 
AGAGTGACTGCAACATCATGGGCTGTCGATGCGCTTCGTACCTATACAAGAGGTGATAAAGACATGTTTGCCGCCCTATCTOCATCGCTGGArATGTACC 
ATCTTGGGCACACGAATTCAGCTAATTTTGTTCCATATTTTAGTAGAAATTACCTATGTAACGAACAAGAGAATGGATTGTGGGGGTATACTCGCAGAAC 
CTCTGAAAAATTGGCCAAAGAAGAATTGGGAAGAGGACGTTTAGGGGGCCTGAATAAGGTAGGGGTGGCTAAAACAGAACTGGCTGCTGCAGCCATTGCA 
ATTTCTTCTGCCTTAGATATGGGGGAAGTAGAAGCTGTAATGGACGACTCTTCTAAAGTGAGAAAAATAGCCTCCACCTGCTTAAATGTTAATGCAGCCA 
AGGTCTCGGCCGCCAGAGAAAAGGCGAGAGAAGCTAGTATTAAACGTCTTCTTCTGGCCACTAATGCACCAGCAGCTGGTTCATCCAGAAACAGTAACAG 
GTTTCTCCTCAAAGATTTGTGGGGGTTCTTTTCTGACCCAG 

TTTCTTCATGCTGCTGTTCCTGATTTTGTTATTGAGTATTCCTTCGAAAGTGAAACCTCTATAGTGAGATTACGTTTGAGACTGATTAAACCTGAAAAAC 

AAGACGAAATGGTATGCCCTTCAACAGCTCCCGAAGCTAATAAGAAGAGGAAATTAGTAAGGAATAATCAAGACGCTGTACTGACGTTGGATGATGAAGA 

TAACATCGTTAAATACAACAAATATGATATGCTTGAAGACGAGGAAGCGCGTGAAAGATTACGCCACCAGGACAAACAATCGGTTATTGCAGCCCGTATC 

AGTAAAGTGTGTGAGCGGAAAAATCCAAAGAAAAAACGTCGTTTAGAAGACCCTGAATTGCAAAGTGTGGATGAACAATTGATACGGGAACTGG 

TTGCCTACTGACGAGATCTAAGATTTTCTAATATATTATGTGT^ 

GAGACGATACTTTTAACCAAGAAACTGCGGTGAAACTTGTACGATGGTATACAGAGTACGATTGTTGTTGCCCATTGGTTAACCGCGTGGAGCGCCTTCT 

AGGATCGTTCGGAGGAGGCGTGGACGCCACGTCTGTGCGAAGCCGACCGGCTCTTTATGAAGAAGATAAGAAGGGAGATAAATGCATACCCTTTAGGATA 

ACGTCCCTTATTGAGGGTATACTTTTGGAAAGGGCTCTAACTAAACCCGATTTAGCTGCTGCAGCTTTTGATGTATCAGAAAAGCTGGTGTATTGTAGTT 

GTAATAACACTCAAGGCAATTTTGATGTCTCTTCAATGACCATATGGATTGATGGCAATAATAGTAAAAAGTATGAAGTTACAT 

CGAGAAAATTAGTGGAGGTGCCGAATCTATTCACAAGAAACCCATGTCTCTTC^ 

ATTGAACTCAAGAAATTGTACCTCTCCTTACTAACGGGCTCGGCAGCCGGAGGAGGAGGTATGTACAAGGACAGCTCCCAACAATCTTCCTTCAACGGCT 
CTTGGACGTCGTTGTTGTTCCACACATCTAAAAAGGACAAGACTCGTTTAGAGGCTGAAGTTTTAGTCAGTAACAAGATAAAACACACATCAAGATTGCA 
GCCTAGGTGCGTCTGTTCCGATCTGTTATATGCCCTATGCTCCACCACTAACAACTCTGCATCTTACGCGTACAAGGCAAGAAATTTGTGTGTTATTGAA 
GGTGGGGAATTTTTATATTTTAAATACACAATCTTTGAAGAGAATGGACCTTT^ 

CTGAGACAAATTCATCGGCACTGGCCGCTTCTTCTTCCTCTTTAGAAGACGACGATGATTGTTGTGATGATGATGACGATGATGATGATGATGAAGACGA 

AAAAACTAAGAAGAAACAACCCAAGAAACAAACCAAGAAACAAAAAACAACAACATCAACACTTCCACCTATCAGCAAAACCAATCACGA 

ATGAATGTACTTAAAAAAGGAGCTGTTAATGGAAAACGGAAAATGATGGATTCTTTGTCAGGAAAA 

CTGCTGCTGGTGGTGGTGCTTCATCCGACGTTGTTGCAGGAGAAAATGAGGAAGAGAACAACCCCTCTTCAGTGAGTCCTACTAACAATAGGGATAGAAA 

AGACTATGTGCTTCCATGCCCTCAAATAGAAGAAGTCACTATTTTTTCAC^ACACAGGATGAACAATAACAAGTTGGCAGAAAGTGTAGTCAAACATTCT 

GTTGTTATTAATGGAAATTGTTTAAACTTGTTTGTTACTCAACACAGAAAAAAGTATATCCTGCCTCACGAAAATATTCTTTTTTGCCCACCTTTAGTCC 

AGCATGTAGGATTTAACAAATTTCGCATTTTGACTGGCGTTTCTTGCTTTTTTGATAGAATTGAAATTGTTTTTTCCGACCAATCTGACTCTGTGGTGTT 

G AGT AAT AATGC TGC CC ATTCGGCT ATTCT AAGGC T ATTGTC AT AC AT A AGAG AAAACTC ATTG AAGCG AAGTGTGAGG ACTGC TTCGGTG AAAGGAATC 

GATTTTGTCGTGAAATCACAGGACACTAATATAGGCATACCTTTAAGCAACAAGGAAATAAGAGAACGGCAATTATGCTCAGCTTCAACCCTGAGTATGC 

TAGCTGGTTTGGGAAAATGATCACCGACCCGGGGGTCATTTTACCCGTTTCCAAGGATGTTGTACTCTTTGGATCACGAGGACAATCAGATGTGGGAATA 

ATGACACTGGACCCTCACGATTTGGATATAAAAATCACATCTAAACGCATAGGTGTGGAAGAAAGACTAGCTCAATACAATACTCTACCTATGGATTTTA 

C ACGGGC AATGG AAAAGG AAC T AAAT AAT AGT AGAAAT ATGAAAGAGTC AAT ATT C ACGGG AAT ATT TT T AG AC ACCGGTTC GGC AATCTTCG AAGAC AA 

CATGTTCAACGGAGGAGGTTCAGCTTTGCGCTTAATTAGATCCCCCGCTTTGAATTCTGCTGTATTTTCAAGCJ^GAACTACATCATCAAACAATTGCCC 

ACCATAACCAAATCTCTAAGGAGAAGTCAAGCTAGAGATAAGCAAGTGGATAAAACAAGAGAGAAGATAGTGGTGGATTCTTTCAGCATACTTAGTGCTA 

TAGCTGCTCAAGTAATGCACCTCACAGACGGAGAGATGACGTACGTCCCCGATGGGCACTGCGTTAATGTTGTCATGTCAGAGACCAATGCTTCGTCCAT 

CTACTTGATCATAAACGACCCCACTGGTTCGGGATGGAAAATTATGCCTAACAATTTCAATAAAACACTTGAAATGAGAGACGGTGTAATAGATAGAGTA 

GAAACATTAGTGGAGTTTGCGTGCAAGTGCGTCGCATCATCCTTGATTAAAAGGGGCATGGATTTAGTGGATATGCAAAGAACTATAAGGTCTATGGATT 

TCCTCCCTCCAGCTTCTTCTACTTCCAATAATACTCCTAGAGTAGCGATAATGACATCTGGAAGTAGTACTACTACGGGCATTGGATCCTTGTCCATTCT 

TGCAGAAGATGGATCAACACACCACCAAATCAAGCTATCGGAATATAGGACTGGATTATCCATTACTGAAAATAATAGAGAGGTGTCTTTTACGGTAGA 

CCTTCAATAGACGGCGTTCAAGCAGAGCATCCTCTATCCCCTTCTATTCTTCAGTGGTTACCTCCTCTAGTTAAAAGGCCAGAAGTGGTAGCAGCAGCAG 

CAGCAGCAGTAGTAGAAGAAGAAAACGGGGAO^TAAACCTTCTGATAAAGATAACGAAGACAAGTACAGTGATACTGATTTTTGGTCTAATGTCCCCGT 

CACACCTCTAATTACACCCAAGAAATGGAGAGCGTGCAAAATAAACGATCGGGCAATGATTAGTAGTTGGAAA 

GATTGGACAAATAAAACTACAAAGGTTGATTATTTTGATAAGATGGCTGCCTTTGTTGCCCTCATGACCTTTAGAAAATTCCAAGACATACTAGCGGATA 

ACTATGTTCCTCCTCAAACCCCTTCTCAGGGAAGTGAATACGCAGTGACCATGTCTAACGTGGCTACACTCTTTACTGACGTGTACGGTTT^ 

TGGAAATAAGCCATTGTTTGCCCTAGAACAGCTAGAAAATGAAACCGGGATTGAAAGCATATACGTCCTAAATATCATAGGAAATTCCCCTGATGGTAAT 

TCTGTCAGGGTCGTCAGACTGGAAAAGGAAATGAGTTTCCTCTTGAAGGCGAAGCAGTACTTTACAGAAATGGCCATACCTCCTATTAATGAAAAATGCA 

AATGGACAGATAAGGCCCCGTCATCTGTAAAGGAGTACAAGTATTTTTGTGATCTAACAGCACCCATTTCAAAGAGACCTAGAAAAGATAACAACGACGG 

CGGT GTGG AGC ATTCTGCGTTG AC T T AT ACACCT AGGTGCAT AT ACC ACACTGAACG T TGTT T AGTC C ATCTTT ACTCTGAGCC AGAAAAAAT AAC AGAA 

CACGTATCTTTCAACAAGGATTTGAACATATTAGAAATTGGAAAAAATATTACCAACCAATACCAAACAAACTACAAAAGCATATTCGAAATTG 

TTCCCATAATTGTCGCCTCTATGTCATCAACAAAAACAATGACTGTAAACAACTACATAATTTCAACACCTTCTGCCACGACCAAGTTTGTTCAGGATCC 

GCCAAAAACAGGGAAACAACTTCTGGCAGTTGAAGAGGTCAGAAACTTTAAACTCAAATCTGTTCTTGTTCCTCCTCCTTATTTTAGGGACAATAAGCGC 

AACACAACTCTTTGTTCACAAATAACTGAACAAAATTGCCCGTCATCTTCTGAAGGTGGGCGTTTTTCATGTCCATCAGAGTCACTTATTCTCAAGTACT 
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CTAATCTCTCTAAAAAGCGCGCACTGGAAGAAATTGCCCCAGAGACTGAGACTAGCATTTTGTCACTAGCCATGTAACGTGTCTCGATATATGACAATTT 
AAACAACCTCTTAATAAAGAGGATAATTATCATAATAGAAAATTAGGTAAAATGGGAGTACAGAAGAATATCCT 

TGTTATTGGGTGTTGTTACACTTCTAGGAACAGTAACAGAAGGAGCACCGGCAGTCCCACCTTTTTCATCGTCTTCCTATTCCTTTACTCCTGAATCGAG 

TGTTTTCTGGGTCGAAGGAAATCGTGTTTTAAGTGGAACCAAGAAGGACACCTTAATCAACGTTCTGGGGAAGAAGATTCCTTATTATGCTAATTCCATA 

TTCAGACATGACTGTTCTGAAACTCGTTCTATTCAATGGCCAGAAACTTCCCCTTTGGGCTTGAACCTTATTTTCTGTTCATGTGCGAGTCATGAACATC 

AACACCGTACTCATGAAACAACAGAACCTGACGATTTATTGTGGGACGGATCAAGAAAAACTACCACCATA^ 

TGTGTGGACATCTTTATGGAGGGATAACGACCAGAAGTGTGGGTGCGGGCAGGCATTTGTTTCTTCCTTTACT^ 

TGGC TTG CT GCTC AC ACT AACGG AA AG AC ATCTG AGGGT GAT AC T AATTC AGCCT ACCT TTTC AT AAGCCTTCAACG AACT AC AC TC AAGCCT ATC ATC A 

CTGACGTCACAGAAGATAATATGATGATGGGAAGAATGTCAGGCACACCCATGAACCCTAAGGATATGACATATTTTGTCAACGATTTTTCAGACGATAT 

AGG AAGT AC TCCTC AGT GTCTTGT ATC G AATTCGGAC ATCCT G AAC AAGAGGGAAGAAT GG AT AGCTGTT TGGGGT GTT GC AGAC TCT AAAG ACCTCCT A 

ACTAAACATCAACTGGGGGAGCGGGAATATGGGAGCGAAGGAAGAAGAAGAAATCCCGGTGTTGAGGAGGAAGAAGAAGAGAGAGTGGA^ 

AAGTAGATVGTTGCGCTACCTTACATTAAGAAAAGTGGAAAACTTATCGGACCTCGTAGAAGACCTTTG^ 

TACTAATCCTATTGTTAGAGAGGTTGTGGAAGATTTTGATTACGAGTCTTTTAATGAACCAGAAATCTTTGGC^ 

TTTCTTGATCAAAAAAATTGGAGACTTGGTATCATGAGCAGAGTTTCTTCTTCCATCGCCA^ 

GTTTGGC AGT CTGGGTT GGGGATGAAC AT ACCCC TAAATTC AGACTT AGTGT ATGG AAG AACTGG AAGC CTTTT AC T TCTGC AC C T ATT ATTGTCCAGAA 

TGTAGGTTATTCCTCTGATGTTTTCTGGCATGAAACTCTTAGAAGCAAAATTGTTG 

GGGGAAGATTGGGCTAATAAAAAACAAACTGTAGTTGCTATGTTTATTTC^GGTATTGTTTGTATA^ 

TATATTACAAAATAAAAATGCCTAAATTCTAATACAATGGTTTATTCATTCCTTATTCCTACTACTAGTGTATAGCACATGAAACAAAGAAAGAGACACG 
CTTCCCTTGTTCCTTAAACAATATGTTTGATACATTTTTCACAATAAATTTGTCAACTTTATCCATTACTTCATTACCTTCTCCTGATATCTTTTCCAAA 
TCCACCACAGTTCCAACACCAGTAAACATGCTCTTTCTGATATCCAATGTTGAATACTTGGCAAATTCATGCCCTTGCATTTCAGTCATGGCAATGATTA 
TAGCACCATACATTTTACATTTTTTGAATAGAGGATATGTGGCAAAGTTG^aGAGAGAATATCTTTGTGGGAAGACCTTCTTCTTCCTCAAAAGTACTTTT 
CAGTGCTCCTCCAGTTCTCTTTACATGCAAAGAAAATTGTACCCTTTTCATTG<;ATGAACTTGTCCACAGAACAGGACATCTTCAGAC^TTGAAAAGTGG 
AGATTGTTAGTAGGTGAAATTGTCTTTCTTGGTGCCAAAAGACTAATCGTATCAGATGGTGTTTCTGGGTTACTTTCAAGAATACTTTTGTCCTTAATGG 
GG AA T TC ACT AAC AC CAGAAGC TGC TGCT ATTTTTGACATGAAC ATTAACAT AC AAT AT ACATTAGC TT CGTCTG AC AT AC ACAGCC ATCCCTTTTCTG C 
ATCTACATGGATATTTTCTCCCTTCTCTTGAGCTGTGCATGTGGACGTTACAATAGA.CCCATGTTTCGATAGACCACAAGTTTCCGTAGGACGACAAAAT 
TTACCAGTCCTACGCAAGTTGTGCTGCATACTCTTACATACAACATCCATGTATTTATTA^ 

TACCTTCAATGT^ATTTGAAAACTCATCCGCG^TTGTTATAGGAAGACGCAATTTGACATCTCTTTCTCCCCCGTCCTTGCAATGGGCAAAATCTTTAGA 

CTCGTAACCCAGATTTGGTACATGAGTAGAATTGGTGGTAATTGATTTTGGGCGTACATACTGATTCAGGCACGGATTGGGAGATGGCAAAAAGGCGAAT 

GAAGAATTCAAAACGTTCAATTTTTCATACACTAAAAATATATAATGGACATAAAATTGGAACACAATTGTAGCCTTGTTATGGGGTGCMTATC 

ACAACATTTCAATAGTGGACTCCTTTGGTGCCCTATTCACAAAAACTGGGTGTATTTTGATATAGTCTTTCTTGTTCTTTTGTTTCGTCTCATATTCTTC 

ACACAACTTTGAAGAGGGCTGTTTTACAATACCACTGGCTATAGAGTAGTGTGTGATTGAACGAACGATAGC^GTGTATAGCTCAATCATTTTCTTGTTA 

CATAATTTCTCATAATGTTTAGCCACATCTCCATTATTAACACCAAAGAAAAAGTCTCTAATGTTTTTAAGTTCTGTTTCTAGCCCATGTTGCTGAATAA 

CCTTAGCAGTGTTGGCCATCCAAGACGCATTAAACGATAAAGATTTTCCCGATCTACTTAGGATGAATTCTTCAGCGGCAGCTACCACATCTTTAGAAGG 

GTCATAGGCAGAATACATTGTTACTCCTCCATCrCCATTATAGTCATCATCCATTGCAGCGGAAGATTCTGCCACCTCTCTAGAGACACTAACTGTATCG 

TGGTTAATAAAGTCGCTCCCATTAAAATCAGCAATGCCTCCAATAGGCATAGTATAAGCCTTGTCTTTACTCATGGCGCATTTAATGTTACACAAGTTAC 

CCCGTTCAACAGAATGATTTTGAATAATGAACAAGTTTTGAGGATTACTATTGCGGSTTACAGCAACATACGCATCCTGGGTAGATGCACCAGAAAGGTC 

AACAATAGTGTCTCTAAAGAAAGTGTGCCCTTGGCAAGAATATATCGTCATGGCTTGACTAGATTCAAATGGAAGATATTGAACGTATACATTGTTCCCA 

AATCCTCCATTCTTGATATTTGGATTACCGATAATTTGTCGTCCTTCTTTAATTTCTAGGCATTTTTGTCCTAGCCGTTCCACGTACACAAATACTGTCA 

ATTCACCATTCTTGTACATTAAATTAGTAACCACTCCAGTGTCCTTAGTCACGAACCTTTCCTGAGTGCCATGGATCATTCTATTAGATGTAGTGAAAAT 

CACATTTTGGCCCTGATAAAGAGTAATAGATTTAGCCCTAGTTACCTTCTTTTTCTCATCCAAAAGAATCGTAGTTAATGCCCTAGGAGATTCGGCTGCC 

CTCAAAATTTTCCTrGTCTGTAAACTTGTCAAAAAGTGGTGTACTGCGAGTTTTAA 

CGTGGCTAATGTCAGACAGATCCATATTCAAAACTACAAATTTTACTTCCACTCTTAATGCGGGCACAGAATTTTCAACTGCCGCAGCCGCTGCAAGTGA 
AGTAGAAGAGX^GGATGAGGAATTGTTATTGTTCTTTTTAGTTTCTTTGATGATAGAAGTATCCACTAAATTCGAGACTGCATTCTTTAAAACGTGTTTT 
• TTAGTGAGTGTGATTGCACCCATCATCTTGTCAGTTTTGGAAGAAATTGCATCCTrTTCTTGTTTGGCTTTAGAGTTGGCCATGAGGCGTTCACACATAT 
CCTTAGGCTGCCCTTGTTCTTTTGTTATAAACTCTTCAATCTTCTTCCTCCTCTCATTTCCCTTTGAAATACTAGTCACCATTTGCATGGCCAACACTAT 
AGATTTTTCAGAGTATTTAAGTGAGTTATTGTGCCTATTGTTAATGTTGTTATTTTCAGTGTGAGCAGTTTGAAGCACTCTGGAAATCTTTAAACTACCC 
TTAACACCCTCCATCATTTGACCATTATTGAAAACATCTTCCTTTCTGTAC^TAAGAAATT^^ 

CTGGGTC AAAGTTGTCC ATG AAAAC AC TC C CATTT CTGG ATGATCT ACCGGCC AT T CGC GTATC T AC C AAGTCTTGAAGGTT GGC AGTCAT ATC AAAC AA 
TGAAATACCGTACATTAACGAATAAGCGTGCTTAATAATGTCTATTCTCTTTAACGACGACATGATATAT^ 

TATGCAGGTAATAACATACTCTTCCCTACATCTCCTCCTGTTGATATGGGCATAGATAATACCCTCAACCAATATTCGTCATGGAGAGGCTTATCTGCCC 
CTGCTCCTCCTCCTCCTCCTACTCCTACTCCTCTGCTTGTGGCCCATTC^GAGGGC 

GAGTTTTCCATTATCAAATTTAACCTGTTGCATTCTCACTAGTGTATACAAATACCACACACGGGCTCGTACCAAATCTGAAGGCTTATTTTCAGTGAGC 

CATTTTTTCCACACAACATGAAAGTCAACTAAATGACCTACACGATCTAGAGTAGATGCGTCACGGTGCAATTTATCTGGTATATCAGTTCTATATATCA 

TTTCTCCACTTTCTTCATCATCGTCATCACCTCCCCCACCACTACTTTTCATACCAGAACCTCCACTTCCTCCGTCTTCCAAATCCATTGCTTCTTCTTC 

ATCGAAATAACAATAGTCCTCGTCGTCCTCATAATAAGCTGCTGCTGC^GCTGTACGTGATGATGCACTATTATTATTACCAGTTTCTTTCTTCAAGAGA 

ATAGATCTTATATTTTCAGATTTCGCTAGTAGTTCTTTGTATGTTAAAGTTTCATTCTCTAAAAATGAAGCAACTGTACGCAAATTATCTT^ 

T AGC G AT AGG AGTTTC ATC C AC AAAGTCTTTTCC ATCTCT AAAG AAGGCGTTCCTCA C AC ACGCTGC C AC AGCAG AC GC ACGGTT ATGTGCAGAATT GAA 

TGGTACTGCATCTATACACTTGGGATCAGAACCGTGCACATACTTGACGTTCAATATCTCTTCATCCATTAAGGAGGTCAGATGTGTATCCTTAAATCTC 

TCTGATAGGCTAGAAACAATATACAATTTTTCTCGGCCACTGTAATCAATTTCTCTCATGGCTTTGGCCATCATGTCGAACACAATACTGCATACATTTT 

CAGTGTACTTTTCTAGATTTGCATTGCTCATTTTCTTGAGTTCAGATATGTGAGCTAGCTCTGA^ 

AGATTTCATCATATCCAAAGAAGACTTCTTGTCTGATACAGAAGTTGTTCCACrTCTTATCTTTTCGTTTGCATCAGAAATGCTCTTGCAGCGTAAAGAG 

AATCCATTCAAAGCTCCCGATGCTCTGTCCTTTTCAAGTGCATTTCTTTCCATCAGTTTCAAAAATGCTTCATTGTTAAAGAAATCGTCATCTTCTTCGT 

CATCAAATTCGTCACAATTTTCTTCTGCCGTGGATTGGGCCGGGGAAGAACTTGGTGCCGTCACTTCCTTTTTATTAGAAGCATTACCACAGTCTTCAAC 

TTTAATAGCACTTGAGACTAAATTGTTGTTAACTGACTTGACGTTATTACTCTTCATTAAAATTGCTTCATGGCGTTTATTGACAATATTCCGCJU3GTTA 

CTTGGGTTGCCAAAAAAGTCTCTAGAGCAAAACATTTTCAGTATCTTTATAAAAACTTCAGTGTATAGTTTTGCATTATCTTCTTGGCCAAAATTATCTT 

GTTCGACTGACCCTTGGAAGAAGGTACTCTTTTCTTCCTCTTTCTTCGTCTCAGCTTCATCGTCACTTTCTTCTTCTTCTTCCTCCTCCTCCCCCTCTTC 

TTCTTCGTTTTCGTCTCCCTCTTCTTCTTCACCACCTTTCTCTTTCTTCTCTTTCTTCTTCTTCTTGTTCTTGTTGTTCTTGTTGTTCTTCTGAGGTTTT 

TC T T C C T C AC TCGGGTTC T T TATT T TC TTC TGGAT ATGGTT AGTGTT ATCTTCC ACT AC TGC AG AATTT TGCCTGTT ATC TC C AAGT AGT AT AAC AAAC A 

C TC C TC C G AAAGGT AT A TC TGGCC T GTGTTTCGC TAT AC GT AAAAC AGCATC AAT AAAC ACC AATTC CC TGC AACTGGCC AT TGT AT ACTCATCGAT AAC 

AAGGAACCTTAAATTTGCCAACACTTGTATC^CTTTCCATTTTAGTTTATCGGCCACAAATTGGATTTTTTCTAGAGTTATTTCATCTTTCCTAAATATA 

ACTGGAATCTCAAATCCTTTGTTGAGGGTAGAAAACACACTTTCTGGGTAC^TCAAGGAAGAACCTCCTGGTTTCTTGCATGCACACATGAGACTCTCCT 

TTGCAGAATTAGATTTGCAACACAGGAGCATGTCTGCAGATCCAGAATCAATCAAGCCGTGGTCGTTAAGAACTGCAATTAGTTCCTTGACCATTGTAGT 

TTTCCCAGCTCCAGGAAGACCACCCAGATGGAACACAGATATTCCTCCTCCAGCATAACTGACAGAAATCGAGTTGCACAGATACCTCACTGCCAGAGAC 

TGTTGCAAGGAAACAGTGCTCAATACCTTTCTGGCCTCCTTGCACGTATCTGGGAGCTTGACCGCAGAAGCAGAGGAGGAGGAGGAAGAAGAAGACC 

AAGGGGATGCCGCTGCAGCTGCAACTGCACTTCTGAGCACCTGGGTTCTGKn'AGCCAGAAAGGGGTATGATTTGATGTTGATAACTGGAACTGGATTTGA 

ATTATTGTTGTTGTTCTTCTTTTGTTTTTTGGTAACTGGCTCTTGGCCGCTGGAGCACGGACGTTTACTCCCCATGATGCACTCTCGGTGAATGTCGGAC 

ACGGGGGTAAAACCAATCCTTTTATACTAGTTCTACCTCCACGAGGTTGCCAGATGGTACGATACTTTATTACCAAAAATGGATCGTCCTCTTTTGC 

AGATTTTGTGTGTAACCCTTATCGTTACGATTGTTACAATATGTCTTATGGCATTTTTTATACATAGCAATTCCTCATCTTCTTCAGGCGCCACTGCCGC 
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CGCTACCGTTGCAGCGGCAGGAGGGATATCTTCATCTGCTTTATTTGGTACCTACATAGGGTCCTTGGTGCAGTCAGAATTTTTTTCAGAAGATATCGAT 

TTACCAGAAAAACAAAAGAAAAAGGCTGATAAGGAAAAGGCATCAAAAACCGATAACAATAACAACAACAACAAGCG^^ 

AGAGCCCGCG<:GCCAGAAACAGGACTTTAAGAAGGAAGAGGAAATTATCCAAAAGAAAAATACACGTCAAG^ 

TCTCAGAGGAGGAAGAGGATTAAAGCGAGCATCTAGAGGCAAAGGACTGCTTAGAGGGATTGGAAAAAGGACATTTGGATTGTTTTTCCTCCCCTTTTAG 
AAAGGATGTTTCTTTTAATTATCATGTTGATTATATTAGCCCCATTAATATTTGrTGGGGTGAATTTCTTCGTATACATAAAGAAGAATAACTGGTTCTC 
TGGGGCCATTAGTAATGACACTACTATTAACTCCACAAATTCTGGAGGAGTAGGAGGATTATGGGATAATGTTTCTGATTCAGAGTCAGGGACGTTTCCA 
TCTAGGGTACAATATTCCACATCAACGAAAAATTTAGTCAATATAGCCATGGATGGAGATTATGCGAC^^ 

GGCCATATGAAACCTACAAAGACGTGGAGGATAGCCAGTTCTATTTACATTTTTTCCACGTTAGAAATTTTAAACCTTTAAACGGTGATGAAAATAAAGA 
CCACCTTGAAAGGGATGAAAGTTTTGTGTTGATCGAATCACCATATTATAATGGAGGGTTTTTATCATACAATATCAACAACCCAAATCCCATTTACAAT 
TCTACTGAAAAGCCGTATATTAACACGGAGATAACTTCCATCGTCAGCACCACTGGTACAGATGAAAGGTTCTTCTGCCTCGAGAAGGAATACGTCGAAG 
ATGGTGAAGAAGGAGTTACAGAAAATAGGTACTTTTTACGCCACATGGCAAGTAATTATGTTGTAAAGGCTAGTTTCAAATCAGTCATGCCCACTATTGA 
AATTAGTGATCTGACTGGCACATACAATAATAAGAATAGTGTTAAAATTAAGCCTGTAACTGGT^ 

TAAAATACATTTTAAAGTATTATGTTTATTGTTTTATTTCTTCTAAAATTTTATTTTAGGTATCCTCTTGAATATGGAACGACGGGTGGGGGGTTTTTTC 

TTACCCTTCTTTGGTTTTTGGTTAGTTTTCTTGGCCCCACCATCTGGTGTTTTTGGTCCTCCTTTCCCCTTTCCTTTTCCAAAGGCTTTCTTGAATGCAC 

CGCCCACTCCTGGTATTGATCCCAGCGCAGAAGCGATCGTGCCGGCCAGTGCTGrGCCCGCCAGGAGAGATCCTAAGAGAGGAGCCTTGATTATTATTAC 

AGCAATAACAATGACAATTATTGTTATTAAAAGTAAACCAAAAAGTATATTTTTrCCTCCTCCTAAAGACGCCACCAATGGCGAAGCCATCTTGGC 

TTATTGTTATTATTGrTATTGTTATTATCCCCTTCCATGGTTGTTGCACATAATAATAGTGTGTATTATAACATTAAAAGAAACATATAAATTTTGGCTC 

AAGGCACCTTCAAGGCTGTGAAAATATTACATGCAGATAATGTTGATTCTGAATTCAGGGTTGTGATTTTTGGTCGCACGTTTACTGACGTCCAACATAG 

GCATATGGATAGTACAAATTGTCTTGAATTCCTCATCTTCTTTGTCCTCTGTACCATCCTGGGGTTTTGCTGGGAACAATGTCCTATATGTTCCCTTTTT 

TATATAGTAATTATTCGGGTTTGAATACAAGTAGTTATGGAAATGGAGATGCGAACCACTAAAAGCTTGTATATTrATCAAAAAATTCAAGTCAACTTTC 

TCGTCATCCACTCTTCTCATTTTTAGGGTAATATCTCTCCCGTTAATGACGTAGATAATTGTTGCATTCAAAGCACTGATGTGCTTTTCATGATATTCGT 

TTAAAACAATTTTATAATGATTTTTAAAGGGATAACTATAAAGAGACCCTTCTTCTAATACGATATTTACTGATGGTCCTGCTACTACCTTTAAAGTCAT 

GATTGTGAACCCTCTCAAAAGAGCTATCCATTTTTTTTCATCCGTAAAAGTATCGTCAACCCTCCGCTTCCGCTTACCCCTCCTCCACTCCCTCCTCCTC 

CTCCTCCTCCCCCAGTCTTCTGCCTTTTATTATCGCTTCCACCTAAAGATCTTCTTCCACCATTATTCGAAAATACAAAAAAACCTACTGCAACAAGAGC 

AGCTATTATCAGAAAAATCACCAGGATCJUVAACAGGGGCATTCGTATTAGAC^TTCTTGCTAC^ 

AATTTGGTTACAAATAGTTAAAAATATGTAACCATGCAGTACAATAAAAGGTGATCAACTGGTTTTTCATTTC 

GCGTATTCTTTGTAACTATATCGGAATATACTTCATCCCATACAGCAGATGAACCTCTTTTTTCAGGCTTAATTGCGATATCCTTTTCAGATTCCATTTC 

GTGGAAGATCTTTACAAAGCAAGATTTATCCGTTTGATTATTGAACAAAAAATCGAGCCGTTTTTTCCCATTArCTACTTCAAAATTCTCAATTGTTAGA 

GAAAAATCCGTCTTCCAGACTACAATATCTTGGGCCTTAGCCAGTGTTCTGATTAGCCCGTGATTTATTTTTCCTCCCATAGAAGGAGATAAATTGTAGC 

TGGTTTCATACCTCCCTTGGTCATCACATTTTACTCTAAAAATTGCCACTGTAAAATCACTCATGTGAAGATTrTCAATCAATTGAGGAATTCTCTTCCC 

TATATCAATTTCAACGCTAGCTGGCGTTTTTAGTACGTCCCATGCAAACG<^TTGAGGCTCTTGTTCGGTACAAAAAAAGTTAATGCCGATTTCGGATCA 

GGGAAAACATCCGCAGCCACCTCAGTTACCCAGGCACTAATGTTGTCTTCAGACATTGGTAAAGTTGCTGCTGCTTTTTCATTTATTAGTGCTCTAATTT 

TCCCTCTGTTGCCCTGCATTTTATTATTGAATACAGTCTCTCCGTAGGGTTTAATATGTCCTCCATACAAATCTACCCCAGCCATGGAAGCAGTTGCAGT 

ATCTGACGACCTCGTGTCCAAAACATTTGTCACCCTAGAGGAAATTTCCCCATCTTTAAATAGTTCTGCAAGTCTCGCCTCCATTATTATTATTTTTAAT 

GTGTACAATAAAAACAATTAATTTGAAACAAAAGGrAATAACAAAATAAAATACATAAGAAAGATTCAAAAAGTTTTATTCATTATCCTTATCACACACA 

TATTTTTTGACGGACAAGTACATTAACATTATCAACATTCAACAGACCACTACCATCATTCATTACAACAACAGAAAATGGGGAAGTTTTGTTGCACAAG 

AACTTCCATACATCATTCTTCTTTACATGAATCAAGAAAGTATTCCAATCATCCCATTGCTCATCGAGCCTGTACCCCAACCATGGGTTTACAAATAGAC 

TATTAAATTTATCCCATGAACAGTATTCCTGSAGAGAAATGAAGGGGTCTTTCCCGTCCAGGTCATTTGGTCCAACATTTTCCACTACCTTTACTTCCCC 

GAGTCCAATTAGATTGACTGTCGTCTTTTCTTTTGTCTCGATCGCACACTCGCCAATGGCAGCCAACAAGGAATCGGGAAGATCACTCCTCATTTTCTTT 

GGCGGTGAAGAAGGAGAAGTGACCTGAAGAGGCGCTGGTCTCTTATTATTATTTrGTTGCAGTTTATTTCCTTCGGCAATTGCAGTGTTCTTGAGTTTCA 

AGAAATAGTCGATAATGTAAAACTCTTCACCCTCTCCCAGTTCTTCATACATCACATCACCAGATGTCCTGATGGCTTCTTCCACCATAGCCAAAGAAGG 

TGCTCCACATTCAATCAAATCCATCAGCAGTTTCTTCCCTCCTCTCCCCATTGCGACGAGTTTGTTGGCGTACCTGCACCAGGGACATTTTCTGTTCTGC 

GGCTTGTCCGGACCAAGAGTAGACATCATGTACTTGGATTTGCAGCAATCACATTCTACACATACAGCAACTGGAATATCAATATTGCCCTCATTCAATA 

GCTTGTAGAGAAGACAAGTCTTCCCATACAAAATCCCCACAGCCTTCGCTGCAGCTGAAACAGCAATTTCATTGTCTTTATCTGTTGGGGGAAGATTGAG 

AGCTCCAATGATATCACTAGCCATCTCCTTTACTTTGTCATCAATTTTCTTGGCATTCAAAAGATTGAGGGTTGTGGGTGATGCAG'TTTTGGCACGAGGA 

TCTCTGGTCAATTTATTGTCCCAGTGATAGACCATTGGTTTGCCCATCCTGTAGTTTCCCTTGCATCTCAAGCACAATTTTTCATCATTATCATATTCCA 

TCTTTAGAGATTTTACACCACATCTTTTGCATATATATTTCAAGTCCTTCTCTGATAGGTCAAGTCGACGTACAGGGAACGTCTCATATTTCTCCCCAAT 

CTTTGAGTCGCCAGCCTTTGTCAAGATTCTCAGAGCCAAAAAGTGTTCTAGTTTATTGGTAAAGTTCTTGTCAACTCCTATCTTATTGGCCCATGTAATT 

AAGCACCTCAAAAATACTACACCGGAAAGATTATTCrCGATCTCTCCATCGGCAGTTCCTTCAAGAATAGAGGGCAAAATAATATCCAAATCAGTCTCGG 

AACTGCTATTCTGCTCTTGGTAATACTTTGTGAATGCCATTTTTGCAATAGAGAGGATCATGTTTGAAGTGTTGGAGGGTACTTCACAGGAGGCAGCCCA 

AGAGGCGGCTGCACACACAGATTTGATTGCTGCATAACCTCCTTCTGGCAGACTCAGACGTACATCATACAAAGTTCCCAATTTTTGAATCACAGTCTCC 

AAGTCGATAAACATCATTTCTGCCACATCTTCTCGTCCTCCAAATAGATCGCTGGCCAATTCTTCTCGAAGCACTTCCTTCTGGTCATCTGATCCACAAA 

AGACTGGGATGGCAAAATATCTGTTGGAGGTTTTTGACGTCTCCAAAAGCACATCTTCAACATTTCTAGTAATCTTTAGACATGATACCAATTTC 

GAAATCAAAGCACTGTCCAAACTCGAACCTCTTGTATTCGTCCAGTGATAATAATCCTCCTCCTGTATATGAATTGGAGAAACTAAATGACTGGTGGAAG 

ATTTTGCCAGCTTCTTCCAAGGCATCATTGGGCACATAGAACCACATAAACATGGAGGATTGTTCAGGTGTGAGATCTGTATTGTTAATCACTTGGAGTG 

CATCATAGCTAGCAGATTTCATGTGCTTTCCTTCTTTTAGGAAGGCATATTCTGCCTTGTTTCCGCATTCAGTTCCCAGATCAGAAAGGATGGTAGAGAC 

CCTCGAGCACACTCCTGCATTATTGCCTGTAACAAAGGCATAAATAGCCCCAATTTCATCAGGCACAGCACTGCAAGCGTACATCATTGATTGTTCCTTG 

ATCATCATTTTGAAGAGTTTTTCAGCAAACATCTTGGCCCAGTGTACGCGAGCATACTTGCGAGACAACTTG<5TTTTTCTTCCTTGATCAGATGTAGCTT 

CAACACTTGCCATCAATTTTGTACAACCCTTACAATCACAGAGGTTTGTATGGTCAATTCCAAAATCGTCCAACGACTGGAAACACGCGTTCATGTGCAG 

TGCCAGAGTAGAGGCGGCATTTCCTCCCCCTGTAATTGACTGGGCCCTTTGAGTCACTACAGCCAGTTTGCTGATACCACCACCACCGCTAGAAGACTGT 

TGTTGTTGTTGTTGCTGCTGTGGATTAAAGGTTTGCTTAGAGCAACAAAAGGCACTATCATATGCACAATTCAAGAACACGCTACGCTTGTGGTTGTTAT 

ATCTTCCTCCTCCCTGTACAACTCTTCCAAAATCTAAAGGTGCCATATTTAAAACCCTAGACTTGAAAGATTCACGTTGAGCGTCGCCATACTTTTCAAA 

TCCTGCAGGCTCTGATCTTCCTTCACTTTCAATAAAAACATTCAATTCTTCTTGGTTGATTTCCCTCAAGATGGGGAAGGCACTAAATTGATCATTCAGG 

TTCTTGAAGAGGGAAATAGATTGCCTCATAATAAAGGTTGCGATATTCTTGCGCACATTGCACGATCTTCTGTCAAAGGATTCAAAAAGTTGCCATCTGT 

TGTTGTCGTATCCAGAAAAATTGATCACATCTTCTTGGCACAATTCACTCTGGTTACTGCCACTACATCTAGAGAAATTGAGGATGCGTAGCCAACAATC 

ATTCTCCACTAGCGCCTTCCTCATGTCAAGAATAGTCAAGGGTTTTTGTCCCCAAGrCTTCATGATGAATTCTTCTTCTGGTGCAGTTTCCGAATCGTAC 

GAGTTGGCACATAATTTTCCAGTGTCACTATTGGCAATACGCACATCCTCATCGGTGAACACAAAAATGAGGATGGGGAATTTTTCATCCCTGGTAATGC 

CATGCATTTTCTTGAATGCAGGAATCATGTTGTTAAACACGTACATCAGGGAAGAATGAAGAGCTTCACTGCCACCGCCGCCGCCTCCGCACACTTCGGT 

CTTGCGCATATGCTTCACAAAATCTCCCCCTTCCCACTCTGAACAGTCTGGTGCACGCACAACAGGTCCACTTTCAATGGAGTGTTTGTCGAAATCATTG 

TAGTAAACTATAGCATTTTGAATGCCTGGAGCAATCACATTGAGTGTAGCCATGATGCCAGAACAACTATCAGCCACAGCCGATTTCACACTTCCCATTG 

AACC ACT AAT ATCCATC AAC G AG AT AC TT AGACC AGT AGGTTTTGC AGT TGG AGT ATC T GT AT C AGC AGC AGCAGCTTTCTT AGTC CATC C ATCTTCT AA 

AAGTTC AATC TTGGAGGAAGGTGC ATT TTC CATC A AAG AGTT AAAAAAAG ATGC AGAT T TC AC AGT AG AACT AGAACTCT TG CTC ATGG TGG AAG AAGCG 

CTTGAACGGATACGGCCCAACATTGTAGTATATCTTTATGGTAGAAGTTGATGTTGACGGTAGGAAACGATCTCGACAACTGTATGCTGAGGTGCCTCCG 

CAGGCCGCTTTTATACCCCTTTCGTGTGACGTCTiArTGACGCTCGTACGTATAAACAGGGTCTATTTTTAGAGATGACATTTTTCAAAAACGTTTATATA 

AGCGGTCCACCCTTCGAACTTGACATCAGGCCGACCCAGCGGTCCACCCCCTAAACTGGAGTGAGCTGAAAAATTTTTCAAAAGTTTTTGAGATGAAGAA 
GAGGGTGAAAAGTX^TACGTACTATACACTCAGAGGGTGGCAGATCGGCCCTGTTCCAGAAATGGCTGTCCAGAAATCTGGGTCGGACAGATTCCAGAA 
ACGTTTCTAACCATTTCTGGAACAGTCATTTCTGGAAAGGGTTACAATTTCTTATAACTGGTGCATTATTTCTAGCATATATTCTAGCCCCTTTGTGCAA 
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TCTGACATTGGGTCGACCCAGCGGTCCACCCTTCGAACTTGACATCAGGCCGACCCAGCGGTCCACCCCCTAAACTGGAGTGAGCTCAAAAAATTTTTGA 
AAAGTTTTTGAAACGAGGATGAGGGTGAAAACACCTGTTAGAAAGTGGTTGCTGGTCGGTAGCATCCGTTCACTGCTGTTCCAGAAATGGCTGTCCAGAA 
ATCAGATTCCAGAAACGTCTATAACCATTTCTGGAACAGTCATTTCTGGAAAGGGTTACAATTTCTTAGATGTGGTACAATAACTTACCATACATTTCTG 
GTATCTTCGTGCAATCTGACATTGGGCCGACCCAGCGGTCCACCCTTCGAACTTGACATCAGGCCGACCCAGCGGTCCACCCTCTAAACTGGAGTGAGCT 
C AAAAAATTTTTC AAAAGTT TT TG AAT CG AGG ATG AG GGTG AAAACCTGGT AAC AG AAGGCATG T CT ACG ACGGCTTCTGGT T AGATCG AGCCGGTC C AG 
AAATGGCTGTCCAGAAATCTGGGTCGGCCAGATTCCAGAAACGATTCTAACCATTTCTGGAACAGTCATTTCTGGAAAGAGTTACAATTTCTTAGATTTG 
GTACAATAACTTACCATACATTTCTGGTATCTTCGTGCAATCTGACATTGGGTCGACCCAGCGGTCCACCCTTCGAACTTGACATCAGGCCGACCCAGCG 
GTCCACCCCCTAAACTGGAGTGAGCTGAAAAAATTTTTCAAAAGTTTTTGAATCGAGTAAGAGGGTGAAAACACCCTCTTGAAAGGCACGATCTGGATGG 
TGTCGTCCGTTCACTGCTGTTCCAGAAATGGCTGTCCAGAAATCTGGGTCGGCCAGATTCCAGAAACGTTTCTAACCATTTCTGGAACAGTCATTTCTGG 
AAAGAGTTACAATTTCrTAGATTTGGTACAATAACTTACCATACATTTCTGGCACCTTCGTGCAATCTGACATTGGGTCGACCCAGCGGTCCACCCTTCG 
AACTTGACATCAGGCCGACCCAGCGGTCCACCCTCTAAACTGGAGTGAGCTCAAAAAATTTTTCAAAAGTTTTTGAATCGAGAAAGAGGGTGAAAACCTG 
GTAGGAGAAGGTACGCACGCATACTGGCGGCACATGCCTCCAGAAATGGCTGTTCCAGAAATCTAGGTCGACCAGAATCCAGAAACGTTTATAGCCGTTT 
CTGGAACAGCCATTTCTGGAAAGTGTTATATTTTCTCGCATCTTGTACATTAACTTGATTATGGTATAGTACACCTCTCTGCAATCTGACACTGGGTCGA 
CCCAGAGGTCCACCCTCGGAACTTGACATCAGGCCGACCCAGCGGTCCACCCTCTAAACTCGAGCGACCCTAAAAAATTTTTCAAAAGTTTTTGAGACGG 
AGGAAGAGGGTGAAAACCCTTGCAACAGAGGGGGTATAAAGGGAGGTACCCTCGCACACCTTAGACACACAACCTCATCACCCTCCGTCCAATCAACATC 
ATCACCCCATCTCTAAATAATCCATCATGTATATCTTCGTCGAAGGTTCCCCCCTCACAGGGAAGAGTTCATGGATGTCCAAGTTGATAGATACAGGATC 
ATGTGGAATGTCTTTCCTCAATTTTCTTCGTATGAACACTTCTGACTACTACAACTGGCCTGCCGAAATCGGGACAGAACATCTCCAGTTAGGTTTCAGA 
GAAACCAGAGTGGTGGATGGAATGTTTGAACCTGTCCTAAAGACCTTTGTCGACTCGTGGAAGAAAGAGCAAGGAAAAGAGAGTTTGAAGGAATATCTGG 
ACTACAACGGCCAAGTCATGGAGATCTACATCGCAGAATGGTTGAGACAAAGGCCACTAGCCTTCCACGTGTTTACCTATACAGATGAAGCTGTCAAGAG 
TGGATTCTTGAACGAGGAGGATCTAGATATGGATACTGCAACCAAGTGGATGGCTGAAATTATTAGAGAGAAGAGGGGCAATATTCAAGAAATAAAAGTG 
ACCCCTAGAGTAGTCTrCAATGGCAATGTTTGTAGTGCATGTTTCTCTAACACTAAGAGAAACTTGTATAACTTTGGAACAAACTATAACAATGTTGTAC 
ATTGTGATTTGTTGTGCCCTTTTGCAAGGCATAG<3ATTGTACATTTCTTATAAAAAATAAAACAATATATAAAATTCAGTTGTATTTTTATTGCTCAAAT 
AAGTTACTACACCCAArTCTCCCCCCTCTCTAGTGAGAGATCCCAATCACTTTCTACTGTCACTGTTGCTGCTGTTTGTTGTTCCTCTTCTTCCTCCTCT 
TCTTCCTCTTCATCTTCATCTTCTTTAGTTCGTTCTTCATTATATGCTTTAATTArCCTCAATACATTTAAAATGGTGCGCTCTTCTCGGTTAAACTTCT 
GATTCTTTCCCTCACCACCAACATCGACCCTACGTCCATCCTGGTATACTGTTTCACCATCACCAGAAAAAATTCTAATAGACGGATTTTTTATAGAACG 
TAC AAC AATGTCC ACT AAT G ATTCTGGTTC TGGTGCT AACTG ATC ATCGCCAAT AAAAG ATAAAAC AACTCCAGAAACAC AT GGAT AAT ATTGC AC ATT G 
AATTGATCATATAAAGTATTGAAGATGATAGGGTAAGATGTTGGCCCGTAGGCAGACATGATCAAACGGGGCCTGAAACCACCCTCATCTCTCATTGTAC 
AATAATTTGAACACATGAATCTGATTCCTCTTTCTGTACCAGAATATAGGGGTGTGTCTTCATTGAATCTGTTTGTGAGGAAAAAGTCTCGTGCAGATAA 
TGCAAACACGAGACACGTACTTTTGTCTCCTATTCTTCCTATTTCGTTTCCATATTCTTCCCTTGCTTCTTCATCGATAGCATCCAATTTGTTGGTTTCC 
TCTAGTTTTTCTTCTAGGTATTTCCTCCTTGCGGCGTCAAGGTATACCTCGTTTGGCATATGGATGACCTTATTACGTTCATGCTTACCGTCCATTTTTT 
CGGCCGAATGTGTAACAACTGCATTCAAAAATAATGCCATGCATTGATCAAAATTTTTAAATGTGGGGT^ 

ACTCACTGGAACAACTAAGCATTGGTCATGACAACTACCCACTATAGGATTTTCTATGCGGAGGCCTTTGTAGGATAATCGATGTGCTATATCGTGTAAA 

GGT ACCATTGATTCGCCCC CT AAAAT AC T ACCCAT AC TT AAC AGTTT CTTTTCG ACTTTGTTGTTGGTT ATCCCAC AACCC ATC AAT AAATC AGC AGT T A 

CAATTATGTACTTGCAGTTGTGAATTGCTGCATTGAATAGTCTATCAGTGAGAATTCTATATAGAGGGGAGAAAGTGAAAGACTTGTCATGGGGTAAAGC 

TTCTCTTTTCACTATATTGTTGATATTCCTCAAATCTTCTGGTAGTCTTTTGAGCCAATCAGATTCAGATTCAGCACCTTTCCTTGTATAGAAATTCCTC 

ATTTGTGAATGATGTTGAAGAAATAATGACGTTTTTTCCAAGCAGGGGTTGCCTGCTGAGAGGCGTATTTTATTCACAATTTCATCCACCTCATGTTTTG 

GAGTATTTTTATGCAAAGAGACGATATATACACCTTCAGCGAGTTCGCTTCTCCGTATACTTTTGATAGACTCGATAATGGACGGTATTCTTCTATAGTT 

TTTAACGATTCCCTTGATAGCAAACCCGGACGCCATCTTTCATGACTCATTTTGGAGGGTGAGATCATGCAATGAAAAAAACTTATAAGTGTGTGTGTTA 

ACGCACGTAAATACCAGCCATTGACGTCATATCCGCCTAGTTTTTATCCTCAACTTCTGAGAATTGCTCAACGTACTCATCTCTTTTCCTGGTACATGAC 

GTCAATGGAAATGCCGATTGACGTCATGTTAGGGGTATAAAACGACTGGCTCGAGCCCTCGGATTCACACTTTGTCTGTCCTGAGCAGTAGAGAAGGACC 

CAGTGAAGAAAATACTCCTCAACTTCCAACAAGAATGGGCTCTGCTAACTTGAACGACCGAACTCTACTCGTGGAGGAAGGGCAAGTAAACGACTACGGG 

TCTACCAACAAGAAACAACCAGTGTGCTGTCAC^TTTGCTATCTAGGAAGAGGAGGAAGAATTGCATGTGTCATCGCATCTGTTCTGGGCTGCGTCCTCC 

TGCTGGTGACTGTCATGACTTTGTTGATTGTGGTACTGGGAACTGCACCAGTTAATTGTGATGTGAGCCCACAGAGCTACTCGCCGCCGCCGCAGCCGCC 

GGTGCAGTTTCATCCTTACCATTCTTCTTCCACAACCACCACTACTTCCACTACTACTACTACTACACCAACTCCACCAGATACTAAAAAAGTTGACGAC 

GACTATGATGACGACGTCAATATTGGAGGGCAATCAGTTACTGTGAATAATGGAGGTGTTTTCATCAATGGAAGAAAACTCTCAAAAGAAGAAGAAAAAG 

CAATGGGTATCAATACAGATAATGGAGGATTTGTTTGGAAGAATGGTGTTTTTTCTCAATATGGCAAAAAATAAAATATGGTATATTAAATATGTGTTGT 

TTTATTGTTTCTTTAAAAAATAAAAAAAAGAGTAACTCCGTCTACATAGTTTTTTTTAAATTGGAAGATATAAATAGAATGAGTAAAAGGTACTGTGCCA 

TTACATGTGGAGATCTTGTATTAGCAACATTAGAGAAATGGGGGACAATAAAGATTATGAAACJUiGACTGATTCAAAGAATTAATGAT^ 

ATTGAAAATAAAACTGAATTATGTGAAAAAATCAATGAGCAGATGAAAAATACACAACTAAAATATGATAA^ 

TCCGCJ^GATGGAGGAAAGAGTTAAATACCTCAAAGAGCAGGGAATCCCTCTAGACCCAGAAGAAAGACGTACAATGTTG^CTGAAATTGACAAGAGTAA 

CAAAGAGTTAGATGCCCTTCTTGAGGAAAATGAACGTATAATAAAGCTCATTGATGAAGAGTTGGAAAGTATGAAATAAAATAAAAGAGTAGGAGACTTT 

TCAGTTGGTGTTATAAATATATCCCTGCAAG^AGATACGGTCTCAAAAAATCCATGACTTGTCAGTAGCCACGGAAAACACACTTGAAAAATACACCAGA 

ATAGATGAGCAGACAGGAGAGATGCAAAAAAAATATGATAAATTATTTGAAGATGATAAAAGGTTCCGAGAAATAGAGGAACGAATCCTTCAACAAAAAG 

AGAAGGGAAACCCTCTAGACCCAGAAGAAAGACTTGTATTGTCGGCTGATATTGATAGGAGTATGAAAGAGATTGATGATTGTCTCGAGGAAATAAACCA 

TATAGAATTATCCATTGATACATTATTGGATGAATGTGAAAACTTGCATTATGGTCTTCAAACAACTAAATAACTCTTTATCGGGCAAGTGACTGTATTT 

GTACGTATGCi^TTAGGTCCATATTGGTTTGCAAATAAATAATCCACACAAAACGTGTTGTTTTTCTATATCCTATAAATTCTTGGAACGGAAGTTATGA 

CTCACATAGAGTGTATAAAAGGAATTGGATGTTCCTATGTCCGCATCACTTCTAACTGAGACATCGTTGGAAGAGGAACAGAGAAGAGAGGTTAACAACA 

ACACCAGTAAGGAAAATGATACAATGGAAAAAAAGACTGAGACGGCTGCAACAACAGAAAAAGACCCAGAACCGTCTGTCAGTAAAAGGTCCAGAAATAA 

AGAACCCAAAACAACTTCTACTGTTTACACTTCTGTAAAGTGTTACCTTTCTI^CCATAATCAAGAGTGAAAGTAGTAGAAGTAATGTCACCTCAACCAAA 

GAAAGGTTTGAGGAGAGGTGTAAATCCGTAAGCAAGATGATGGTCAAAGGTTCACTGTTTTTGAGGTTAGTAGTGGACGAGTGTCTGAGACGTTACAAC 

ATCTAGAAGACGAAATCGATAAATGGCCAGATATGACGAAGGATAACTTTTACGTCCAATTGT^ 

ATCTACACATCCTGTTGTAGAAGATGTTTGGAATTCCCCCATCGTCCAAGAAACATTCCTATCCCAGCAAGGAGAAGGAAATAATCCCATAAAGAGACAT 
CTCATGGATTTCAATACCATCACCTACGCCGCCAAACAACTAAAAACTTGCCT^ 

TATCTGGATGCTTAGCTGAAAACGGGTTCGATAAAMGTATACGAAACTCGTACAACACTGGATAATTGGATGTACCTACAAGAGTGATTGGGTGGACAG 

TGGTGATTTGGAAAGGGTAAAAGAAGGAACGAAAAATTTCGTGACTCTTCATAGGAAACATTTATGTGTTATTAGTGATAAGAAGAATGGTACAATTTCC 

TATTCACCTGAAGAGAAATATCCGATACCCTCAATACTAAATTATTACAAGTTTCTACAAACAGAGTATCCAC^AAACAAGAAAATACAGAAAATGATAG 

TTGTCCCAAAACACAAACTAAAGATACACTATTGTACGTTTGACCAAACGACCATTCAAGGAATTTGTAAAGATTTGGGAGTGTGGAAGGATATGGAAGA 

ACGACACAAACAATCAGAAGATATACTTTACAAGCAAGGATGGTACCTATTATTCGACGTTAAAAAGATTAAGAAATTGCGTCCAAACTGGAACTTTCAC 

TCTATCCAGACGGACGGCGAAGGCGTCTCTGTACTATTTTCCAGAGAAGTGGAAGAAGTAGAGACTGTTTCCAAGAAAAGTAAGAAAAATAAAAAACCTA 

GAGGAGATGAGGATAGGAGAAATTACCCGCCCACTAATGCCAAGTACGTAGTGGGTGTAGATCCGGGAAGAACTAATGTCGTTTCCTGTTCGGTATTTGA 

TACCCGTCAAAAAAGGGTAGTGAGAAAACACAGAATGACTGCGAAACAATACTATCAAGAATCTTGGATGACAGATAGAAGAAAGGCAAACGAAACGTAC 

AAGAAGAACAATAAAGAGTACAAAGAGGCGTTAGAGGAAATAACTAGGTACGATAATGGCGAAGAAATTATAAATGATGGTAACGGTGATACTTCTACAC 

CCACTAAAAAATTCGAAGCTTACTTGAAGGTAGTGAACGAGCACTACAGGTTACTGTGGAACGAAAAGGGAAAGAAAAAGTACAGGAAAAATGCCATGAA 

AGTATACTCTAGAAAACAAAAGTGCATATCTAACTTTATAGATGAATTAATCCCTAAAAGGGATAAAATTGAAGATTACCACATTGCTTTT 

AAATTTGCCTG<:ACGGGAAGAGGTGAGCAATACG<^TCACCTGCCAGGATTTTCGCCAAGAAGATAAAGGAAAGAGTCGGAGGTGATAAGAGGTTTACTT 

TCGTGGACGAGAAATATACGTCAAAAGTATGCCATC^CTGCAATCAACCTTTAAATATGCTGGAGAAGGATTGTTTTTCACCGAATAAGAAAAGAAAACC 

GCCGACAATAGTAACAACCACAACAACAACAACAACAGAAGAAGACGAAGAAAATGGAAAATGGAAGAAGGCTACACCTCTCAGAGAAAATAGAGATACC 

AGAAGATGCTCGTCCGAAAAGACGCAATTCGGTTACAGTTCAAACCGAAAAGTATCGACAGGAGATATCTCTATGGAAACGCCAGTACCTTCTTCCACTT 
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CCTCTTCCTTTTGTACTCCTACTTCCATTACATGTGTCTTGGGAGGAAAATTCGTCGACAGGGACTTCAATGCAAGCACCAATATTGTTCATAAATTTCT 
AGGGTTTTGGGATAAAAAGTTAATGGAAAAGAAAGACAAGATGCCGTTGAAGTATCACTTTATTCGAGTTGCCTGAGAATAGGATGTAAAAAACTCGTGT 
CACTAGTGAAATAGTGGCGTAGCCCTGAAGGATGAGAAGTATGCTATTTTCGACCTTACGGTCATTGTGGTATATCAACTCTTGCCCGATAAAGGGTTAT 
TCAACAACATAAAAATGGTACAATATTCTAGGTGTGTTTTACTATTCTTTGCTTGAAATCCCAAAAAG 

TCAACAAATGGGGATATTTCTGCCTTCACCTTTTGCACAACAGACTCTATGAATTTATTACACATCAAATTCAATTTGTAGGCTGTTATAGGTCCAATCT 

TCATATCGTTGTAGAATCTCGAACCCTCGACACGTGATTTTTCACGGGCAAGGAAAGCAATATCAGTCAGTATAGGGAATTCAAAAGTGACTT^ 

TATCTGCAAATCACTGAACGTGCGGTCTGTAATGGTCATTTCGTTGTGCCTGAGAAGTGATCTCGGACGGAGGCGGTTAGTCATGTTTTCCAGTTGAGGC 

AAAAGATATACCATTTCTTTATAAATTAGGCGAGAGGCTATTTCTTTCCTCGCTTCCGTTGCGGCTTTTTCTTCCTCGATTTTTATGTCTCGGACCATGG 

CTTCGGCAAGTCTTTCTCTCTCCATTCTGGCGACTTCGATTGATGAGGCGAGGATCTGCACATCACCATCAACATTTTCATCAATATAGACTTCATCTTC 

ATCGACTCTGGGGAGCTTACTCGGCGGGTGTGTTTCTACTGATTCTTCTTCTTCTTCTGTTCCTTCAGTGAGGTCCTCCGCCTTTCTCTTACTGGTCGTG 

GGAGAAACTTCTTTrGCTTTTGCTGCTGCTGCAGCTTTTGCTTCCATAGACCTGGTGGAAACCATTTTGAGGTTCTTGGATGATGTTGTATATGAACTTC 

TTAGTGTTCAAAGAGACTGAACAAGTGTGTGCTCAGGTGGGATCCTGCAGGCCTTATATACAATTTCAAGATGGGGAAGTGGGGGGAGAAAATAAGGTTG 

CCAACTAGTACCTTTGTCACCTACACCAGAAATCCACACATGCAGAAGGTTTGATGCTAGTTTTTGGATACATTTTCTGGGTCCAGTTGTGTGGGGGGGG 

GGTGGTATAAATATAGCCCACAGGAGGAGJUSCCTGTGTCATTCTGTGTGTATGTCT^ 

GTGGTATTGAAATTGTACATAAAAATGGTGCACCAAAAAGGTCTCATAAAACTCTTTACTTGT^ 

GATCGAAGAGTTACATCACAAAACAAATAAGCAATTTGAACAGGCTCAAAAGGTCCTCGACAAAAACG 

ATAATAATTCCTCTAGACCCAGAAGAAAGACGTGCAATATTGGCTGAAATCGATAAACACATGAAAGAGATTG 

TAGGTTTACTTGTAGATGCAGAAATCAATAACTTGGAAGAAAAGGAGGTTGAAGAGGAACATCTTTTGAAACAAAAAGAAGACTAAACAATAAAACATCT 
AAAAATACATTATATGATTTTTTGGATTAATAAAAACAAGTGGAATAATTTTTAGGTATA 

ATCTATTGTAACTTCAATAGAAGGGGTTATATCTTCCTTCACTTTCTGTACTACAGCCTGTACAAAATTATCACACTTGTGGAGCAACTGGTTAACTGTA 

AGAGATCCAATCTCCATGTCGCCAAACTCTTTTGGTGGGGGTGTATTACCCCACCTAGAAACGGCTTCACGGTTGATCAGGAACCCAATATCAGATAGTA 

CTCCAAAACTAAAATTGACAGTATTTTGTACCTCAAATTCATGCTCATAGTGTGAAACTGCCATGTCATGTTCCCTTAGGGGGTTTTTCAGTTTGATGTT 

TTTGGTAGTATTGTCCAGTTGCACCAAAAGATTTTCCATTTTCTCTGCAGTGTCGTCGTCCACTCTGTCTTCATATTCTTCCTCATCCTCCTC*TTCCTCA 

TATTCCTCTTCTTCACCATCGTCTTCTTCTTCTTTACATGGGAGGGAGAGGTGATGTGTTTCTTTGTGTTCTTCCTCCTTCTCTTCCTTAATTCTGGGGC 

GCTTGTTACTTGACGAGTCCTCTACCAGTTCTTCTTGTTCCATTTTTACTTCCACTTTCACCTTTACTTCTTCTGGCTCCCTCTTCCTCTTGGTGGGGAG 

AAGAGTAGAGGTGGCTGCTACTGCAGATGAAGATGTTGTTGATGTTCTAGAAGAGACCATTTTGGGGGTTATTTTTGGAACTCGTGGGTAGAACAAGTAG 

TAGTAGTATTAGACTCTTCATTGTTTGACTTCl^TGTTACTAGTTTGCGAAATACGGGCAATGTTTTATTATTTATAGATAATGTCACATATG 

CGCAGATAATGTTAGATTTGATGAAAATCCTGGATCTATAAAAAAGAAATCAAAACACTTAAACGAATTCGATTTAAAAATTTGCGAG^ 

AATAATAGTAGATTACCACAAAAATTTGTTAATGCTACAAATAGATTAATATACATAAAACCTCCATCTTTACACGATGTTTGTAAAGTACATGGTTATA 

TAAACCCACGGAAATTATTATTAGATGCAAAAGGGACCCAGGAAATCGTTTATATGTTCCGCTCATTTTTTGGACTACTTGTATATCACGGATGGGATGT 

CGATGGTTTAATGAATACTTTCGGTTTACTAGAATTTGTGACACAAAATTTCATTAACCAGTTAGAAATGTACTTAGATAAAGACCCAGGTGAAATATTG 

TATCATGCAGAATATCTTATGAATACTTTAAGGAATCTCAGAGAAATTGAAAAAGAGCCAATTAGGGATTTAGGTTGGATTAAGTACAGGGCTAATTATA 

GAGGTTTCAGTTAATAAAGGACTGTTTTGTTTTTTTTTATTTT^TGTCCCAAACCC^ 

GCCTTGTTCAAAAGGACTACATATTCATTCCCTTCATTTTCTTCTCTTTTCCTTTTCTCTCCTTTACTGCTGGTGCTGCCGTTTCCATCTCCATCTTCTG 
AAGTTTCTCCTCCACTTCCTCCATCTCCACGTTCTTCATTATACATGGAAATTGCCCT^ 

CCGTTCGTCTTCGGGCAAGTTTTCTTCCCTTTTGAGTAACTCTTTGTATTCCTTAGATGTCGGATCAATCACATTGATTTCGGCACCATTTTGATATACT 
TTTCCATTATCCATGAAAATTTTGATGTCTGTTCCTGTTACCTCTCTGACCATTTCCATTAGTTCCCTTTCTCCTGGAGGGACTGCAAAGGTTATACTAT 
TGCCACTGGTGCGGGAGAAAAGTCTCGTCTTGAAGTGGTTTCGTACACTGTCCCTAATACATATAGGGGCATCCGATCCAGTTATGGACATGATCAAACG 
GGGCCTACGTTTCTCCCACATCTCTCTTTTTATATTCAGGTGAAGAGACATACCTGATTCCTTCTTCTGTTGCAGAATAAAAGGAACCATTTCCGTTGGTT 
CTGCATAGCAAGGAGAAATCTCGGTCAGAAAAAGCAAACACTAAGCAGGCTCCACCATCATAGCGGGATGTCGCAATCGATGATTTTTTGTGCACATCCC 
TCAAAGAAATCCTTTGTTCTTCAGAGACGG^CAACGTTTCCTCAATTCCTTCAGTrCTTCTTCCTTCATATTTACTTCATTCTCTGCAAACTTTATTTC 
TCTTCTGAAATTATCCATCTGTAAATTATCCCTTTCATTTGCGTATATGACGGTTGCATTGAAAAGAGATGACATATAATTTCCAATACTTTTAGCTGAT 
GGGTGTGTACTGTTGTAGAAAATCAAACCTAGTTTATCCATTGGAACAATTAGGCACATGTTGGAGTAACTGCCTGTGATTGGGTTTGCAATGTAGGCCC 
CTTTGTATGTTAACCTATCACATATATCATACACTGATATGCGGGAAGAAAAGCCACATTGTTCCACCAATAATTGTTCCATGTTACGAAGTGTTCTAGG 
GCAATGGCTCGGCGrGTAGCCACACCCTACTAAAAAGTCAGTGGTTATTATCATATAGTCACATTTTTTCACAGCTGCATTAGAAAGTCTTTCGTGTAAA 
AGCCTGTAGAGAGGAGAAAATTTAAAGTTTTTGTCACTAAGAAGGGCCTTTTCTTTAACGATTTTGTTGATATTTTGCATATCAATGGGGAGTGATTCGT 
TCCACTTTTCCCCATAATCCTCACACCCATCATCTTGCAATAAACTCCCTAACCTGGGGAGATTGTGAGTATACTTTAATCCGACTTGAAGAGTATTGTC 
GCCTGTCAATTGTCTCAGTCTATGCACTCCGTTTTTTATGTGAAGTACTTCCTTATCTTTTAGACGGACTATATATAGTCCGTCCCTTAATTCACTTTTT 
GTAACCCTATCAACAGATTTGAATGATAATAGAATATGTTCGTATTTGTCAGGGTTTTCACCAGAAATAGTGAATCCATTTTTTACCTTTAAAGTCTTGA 
ATTCTTCCATTTTGATATACTTAAATGTGTCATACTCGGGTGTCACAGATAATAAACCTACAGACCTTGCCATTGAACTGTGACACAGAGTGGGCCGATA 
CGCAAATTGCCACGTTGCTTGAATTACTTAATCTGTAGATGAAATCGGCAAAAAATTACGTCTTTAG 

GTACACAAAATGTGGCACTTTTCAAGCCACCACCTCTACATCCCAGTGAGGCGGTGGAGGTGGACGTCATATTTAACTGGAATATGTGTAATAATATAGT 
CCAATTCTTAAAAAATCAATAGACCAATAAAACAACAAAAAAGATTTTGTAATTTATTTATTCAACTAAAATCTC 

CTTACATTTCATATCACAGTGTACAATACTATTCGTGCATTTACCAACACGGTACACGTAATTCTTACTCGTCGAGAAACATGAACTACAAATATTCCCT 
TTGTACACATTTCTGGGGATAACTTCAATCTTTACAAACAACGCTCTCTCCTCCTTGAGAACGTCAATAAACCAGTTATGTGCATCATTCAATTCTTG 
TAATTGGTTCCAl^TTCTTGACATCTTCTTCCCTGrAAACCATCACATGGAAAACTGGCTTGTATTTGACCCACTCGGCGACACATGCTTTGAATTCTTC 
TCCATACATCTCCAAArACTCCCTCAATCC^^TTTTTGATTCTGGAAACTGTTCTGTGTCGTGCCTCCACATGCCTGTGAAAAAGTTCAATGATGATCCA 
. AAC ATTCCATCC ACG AGCC TGGGAC GT TCCTGGT AAT CGG AT GC AC GC AAATGC T CTTG AATGGT CC ATGGGAAG ATGGGT AAAT AG TC C CT ATAATTGG 
TGTACATGAAGTTGAGGAAAGACTGTTTGCCCTTTCCGGCAGTTCTCATGTTATCCACCCATGAAGTTTTCCCAGTTAGGGGAGATCCTTCGACGAATAC 
AATCATGTTGAAGGGTAAGACTTGTCTAACTCTCTCGAGCGAGCTCTCTCTCTTTTATAGAGGCAAAAATGTAGTTGCCAGCCACCACCTTAAAA 
GCATTAATAATATATAATTTTTATATTTTGTATCATTCAACCGTAGTCCATCTAAGAGTCAACATGTCCTCCTCCTCGTCGTCGTCGTTCTCGTTCCGCA 
TCTCCACCTACCAGACCTTTCTCAAGGCTCT^GCTCACCCAGATCTGGTGGACAAGATTACCCAGAAATGTGACGAGACAGGAAGAAACCAGAAGTGCCC 
C ATC C AGTTTCTGGCCG AC ATCTCGC ACCTGATC C AAGG AGAAAG AAATGGAGG AAATC TGT TC CCTTTGCACCCGTTC AAG AAC C AACC AC ATCTGG AA 
CCAAGAATAGTGGGAAGTCTTCACGGGAGAAC^TTGGACAATGACATTGAAGAATCATACTGTTATTTTGTCAAGGATCTGTATAATGGAGTATrraCCT 
ATGTGAACGGCGTCAAGGAGTTACAGGGCGTTCTGGACAAGAAAATATCTGGATCTGGATCTGGAGAATCCTCCTCTTCTAGAGCTCCCCTGATCCCAAT 
AACTGATGTGGATTTGCTGTACATTTTCG^TACTTTGGTAGTTCTTCCCCCCAGATCTAAAGCGTACCGAGTCATCACTGAAGCTGTTCTAGCACTXTCCC 
TTCAATGAATTCAGTAACAACTGGCCTCCTACAAATATCAAAGGAGCATACGTGTCTAGAGATTTCAGGATGTTTAATCTGTTGGCCGGTTTAGATCATA 
TAGAAGGAGAAGTTGGGGGAGAAAGTGAATG^GAATCCATACACGCATCTGTCGTCAAGCGAATGGTCACCATTATGCGC^ 

TC C ATC A ACATCT AG AATTTTT AG AGTGT ATCTGGCTGAACC AGT T AATGATGC AGTG AC AAAG ATC CCT AT ACGTGT ACTC AGT AAATT ATTCGCT A 
AGACTCGCCGGTATTCTCCAGAAAGTGTACTCTTATTCAATGCTAAATCTTCCATATCTCCTGTCTTCAAATTCTATAGACATCAAGCAAGGAGTAAAAG 
G AATT AC ATTATCT AT ACC ATCTGC AAG AAAATTGGG AT TCT AT TT ACT CC AAAAGG AT AC AAC ATT AC AATCTTCC CT ATC AC AAGAT GTTGCCGACTG 
CATAGTTTCAATCAACGCTGGTATTATTGGTGATGATTTCTCTGAAAAAATACGACAGTGCATTGAGGAGAAAAACAAGCCAGAAAACTGTTGTATGTGC 
TTTTGTGAAATTGACAAGACGCCCGATTTTTCTTATAGTGAACATGTGGCAAGGCACAATTTCTTCCCCGTCCACGCATTCTCCTCATCACATGATGACA 
AGTGTTGTGGAGCAAAGATTTGTTCCGAATGTATATTCCCGTACATCATTTCCCTGTATGAGAAAATGACTGGTGTGGCAGGTGTAAAAGTTGTAGATTT 
GTTCCAGTGCCCTGGTTGTAAAAGTGGCATGCTCAATCTAAAGGGAAGATGTTACGAGTTTTCAAATTTGTGTAAGAGAATGATACTACCATACACATCG 
ACTCATTGTTCTTCTCTCTTTGATGCTACCATAAATCGCGCAGAGGCTTGTTTTTACTCCCTAGAGTTTCT^ 

TTGCGCATGGAGCTAAAGACATTCCCCATGTTTACAATAAGGTAGTAAAGAATGTAAAAGATTTGGATAGACTGTGTGCCTTGTACTGTTACAAATGCGT 
ATCTCCTGTCGTATGTGATGAGCCGAATGAAAGTACGGACTATGAAATGGTGGATGTAACTCCCCCTCTAATTAATCTTACCGAGATTGTTGATTCGGAA 
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G AGT ATGATGATGGTCC CGG AAAT CAT ATGTGGC C AGC AAAATT T AC CT GT AAC TTT AT TGC GGGTT CC AGTGGAG AAAC ACCC ACC ATT AGT ACGTGC A 
GAGATGCTGTAACTTTTCTAGGAAGAGCACCGAGGAAGAAAATGGCAGGATGGGATGATCAATCGGCAGTGGGGCAAGCCATTATAGCGCTAGCCAACTG 
GAGAAAGAGTGGGGAATTGCCCAAAAATATGTTTGATTTACTAGAAGGGGTAAATGCCGTACTTTATAGAG<K:GACAGTTTCTTGTTACGTGCGATAAAC 
TACCCCTGTGTTATTGGTAGATCCATGAGCCCTAGTCTGGAACTCGTTAAAAGAAAGGTGAATAAAATTGCTTTAATAAAGGCCTTCTTCCACGAGAAAA 
GGGTGCGTCCAGACGCATCTAAAAAGTTACTTGAATGGGCAGAACTATTAGTCAAAAGTTATCTCATGGAAGTTTTACTTCAGACGCCAGAATGTGTCAT 
ACACCGCGCCCATTCATTTGTAGGCAAAACTCTCCTCATTACTGACGAATTGCTTCACATGCGTC^ 

CTAAATGCGGCCAGACAGAATGCGGCTGCTGCAGCCTCATTTTCTGGGTCGCTCCCCAAACCTGAATTTGTCCCCTGCAAAGAAAGGACGATTGAATGGA 
TGTATGAAAAGGACAATGATGATGTTAGAGTTGTAAATTGTCCTTCATGTAAAAAGGCTATCCAGAAATATGGAGGTTGTGTGAATGTGTTTTGTGAATG 
TGGAACAAACATGTGCTGGATATGTGAAGAGAAGGTTTCTCCTGCTGATTCTAATCATTGTGTGGAGAAACACAGGATTGTTTATAGTAACTGTGTTAGG 
GTTAAATATGCCTTAGAAAGTATGTACGGGTTTCAGATTTGTACCATGAAAAATGTAGAAGAAGGAGTTAAAAATTATTATGTAATGGAGAATGGATTTT 
TCTTTGATGTACAAGAAATGGTTGCTAAGAAATAATAAAAC^CTTGTGAAATGTTATAACTTGTT^ 

TT GC ATGC C TTGTTC AAAAGGGC AAC AT AATCGCC TTCTTTCTCC TCCC TCTTTC TC TTTGTTCCTCC ACTGCTTGTGGCT ACTTCTTCTTCTTCCC CAC 
GTTCCTCGTTGTACGTCTCAATTGCCTTAACTGTGTTGAAAATCATGCGACGCTCTCGGCGCAGACGCTTCCTTTCCTCCTCTGGTAATGTTTCTTCCTT 
TTTTATTATGTCCTTGCCTTCTTTAGAAGAGGGGTCAATCACTTTTATTTCTACACCATCTTGATATACTTTTCCATTATCCATGAAGATTTTGATGTCT 
GATCCAGTAACCTCTCTGACCATCTCCATCAACAACCTTTCATTCGGAGGATCGATGAATGATATTTCATTACCCTTTCCGGATGCAATGAAATGGTTAT 
TAAAATGGTTTCGTATACTATCTCTGATGCATATAGGTGCATCTGAGCCAGTTATGGACATGACCAATCTGGGCCTACGTTCATCCACGTCCCTCTTTCT 
GTAGTCGTCCGAAGAGACGTATCTGATTCCTTCTTCTGTGGCAGAGTAAAAGGAACCATTTCCATTGGT^ 

AAGGCAAAGACCAGACAGGCACCGCCGTCATATCTGGATGATGCAATTGAGGACTTTTTATGCACATCTCGAAGAGAAATCCTTTGTTCTTCTGAGGCAG 
CATATAGCTTTTTGGATTCTTTAAGTTTTTCTTCCAGAACACTTCTTTCGCTAACTAGAAGGTTGACTTCATTTTTTGCAAATCTGATTTCCTTTTCGAA 
ATT ATCT AATTTTT GAT T AC TC TTT TC TAGCGTGT AT AC AAC TGC AGCATTGAAAAGGC ATGAC AT GT AAT TTCC AATGCTT TT AGCTG ACGGGTGT GTG 
CTGTTGTAGAAAAGTAATCCAAGTTTATCCATTGGTACAATTAGGCACATATTTGAAAAACTGCCAACTATGGGGTTTGCGATACTTAAGCCATTGTACG 
TTAACCTATCACAGATATCATACACTGATACACGAGAAGAGGTACCACCGTGTTGCACTAACACTTGTTCCATATTCTTAAGAGTACGGGTACAATTTCT 
AGGAGAAAACCCACACCCTACTAAGAAGTCTGTTGTGATTATCATATAATCACATTTCTTAACGGCCGCATTTGAAAGACGTTCATGTATGATTCTGTAT 
AGGG GTG AAAATTT AAAGT T CT TGTC ATT AACTAG AGCCTTT TCTTC CAC AATTTTGTT CATTT C CT GG AATGGGCTTGGTAGT GAT TC GTTCC ATTTT T 
TTCCATAATTAGGGTCTTCATCTTCTAAGAAACTACCTTGTTTTGGGAGACTGGATGTGrATTTTAGTCCAATCTGAAGCGTAGGATCTCCTGTTAATTG 
TCTCAATCTTTCGTTACCGTTCTTGATGTGGAGTACTTCCTTGTCTCTAAGACGAACTATAAAATGTCCTTCCTTTAGCTCACTTCTCCTGACACrCTTG 
ACTGATTTGAATGTTAATAGTACCCGTTCATAATTTTCGGTATTTTCCCCAGACAAACAAATACCATTTTTTACTTTTAATTGTTTAAAGTCTTCCATTT 
TTATACCGTGTAACCCTAACAATGGTGTGCC^GTATAATAAATCTACGGACCATGCTCTTGATCTGGGAAGTGTACTCGTGATTTCTGTAGAAATTGCTA 
TTTCCTATTCATTATTTCCATTACAGGAAAAGATGATAAAATGAAGACGTGCGATGAGATATGTGTTTTTTCAATATCGTATATCAAATTTGGGCAAATT 
ACATATTTTTTAGCGTCAACACTCATCTTCATGCAAGACTTTTTGGAGTGAAATTACTAAAAAAGGCTATATTTCATGTACAACATAAAACACATGTGCA 
GACATTAAATTGGTTTGCTCACCCCCCCAACATCGTTTTCATCAACTCTTTCCTCACCTCTATATAAAACCAGCCTGCACCCCCTCGGAACCCACAGTCG 
ACTTGTACCCTCCGTCGAGATGACATCTCCAGCTCCATCACCCTCTTCCACCCCCAAATCCAGTTGTACCACTATTGTAAACCGATGTGGTTTCCTCCTT 
GACAACAACAAGGAAGTGGTCATCTACGACACCMTTCCAAATTCAAGTGTGAACCCAAAAATCTGGAACTAATTGGTGTACTTTCTGGAGTCTCTGATA 
ATGTTGTTACCC^GATATCCCCCGACCAGATATTTGTGGGAACATATATGGTCAAATATAACTGGTCTAAATCTGGTCATGAACGCTTCAGTGACATGAG 
TAACAACTGTCTGGACAATATTACACGCCCTTCAGAAGTGATTGAAAGTGTGATAAAGAAAACGTCCAGCGACT^ 

ATGGACCACACCGAGAAATACTATTTTTCTG^TGACCAAAAATTGAGCAAAATTAGTAGTTGGTGTACAACCCCTATACGACAGTGGGTATGCAACTCCG 

TCTAGAAAACTTTGTCAAGGAAGAACATGAAACTGTCGTGGTGCACAACCCTTCTGGAATGACTGGATTCAACATATTTAATAGTTCCCCCGTGTATTTT 

GAAGTGCACAATGAGATGGACGCCCTAATTTTTATGGCGGCTTTCTTGAAGCACAATAGTTTATGGGGAGAAATTAACGCCAATATGGACTTGTACACGT 

TTGATTATGCGGGTGCTTTTCTGGACGAAAGATGGTGCCACCACGAGAAGAGTTTTTCTGTCGTCCGAGCACAACTTATCAACTCGTATTACAAGTGCAG 

GAGAAAAATCATGCAAGCCCTGGACAATAACTACAACAACAAGAATAAGAAGAGGAAGAATGTTGGTGGAGCACCTGCGTTCACATTTATGAGCGGGGAC 

GGAGAGGGAGGAAAGGAAGCCCTAGAAGCTAGTTTCGATGTGATTGGGGGAACAAGAGGAGGAAGATTTGGTGTTGATTCAACACCATGCCCCCATTCTT 

CAGCCATGCAACTAAAACTGGACAATGAAGGAAACTATGGATGTATTGCCTGCTTCGCATCJ^TGTTCTTTGTATTGGAGAACCCAGGTGA 

CTTCATATCAACGGATGCCTCTAAAATTGGACAAGCGCAAGCATGGATAGATGAACGACTACGAAACAATGAAAATGGAGGAGAAGAAAATAATGTCTTT 

AAAAAGACCTTCCATATGCTGGCTGATATTACCCAAAAGGCTCATGAAACTGCCTATTCCAATACCATCCCACTTGGACCCAATGGCAGGCAGTGGAATT 

GGCCTACTCACACTGTGGAACCTATTGCCCATGAATTTGTTACCCATTCTCTAGTAAACACATTGAAAAATCTAGGGGATAGAAAACTTCCCCGATTCAA 

TTTTGATATCTTGTACAACTTGCTTAATCCATTTGGAAAAATGTTGCTAGTGTTT^ 

GTGGTGCCTCGAGGTTCTGCTTCTGGGAAGTGGTGGACTATTAATTTTGTGGGTGTGAACATGTGGACTTTTC^GTAACAAAATGTAAAGTTGAAAAGG 

ATAGAAAAATATCCGATTTGGCCTGTATGGAAACTCTCCCTCGTCTACCTAATCCAGGAAGCACTACCGTCGATGACAGAATAGTTTTTAAGGGATTCTG 

TAGAGGGGAAAATCTAGGGAGTGTAGGTGAAGTCGTATCCGACATTACACAGAGTGTCAAGAATTTTTGTCTCATGGTTGAAAATAGGAAATTTAGTGTG 

GATAAAGAAACTGGTTTCATCTCTTCAGAATCGATAGTGTCTGATCCCTTCTTTTCACTAGAAGTGACTGGCTGTAGATCTAATCGTGCCCAAGATACTA 

TTAATAATGGCCGAGTTAGTGCTCGTGTAATGAGGATCCTAAAGTCACGTGAAGGTGCTCGTGTATGGTTGGCCAAGGATGAAAATGCCATCATCTTTGA 

AAACGTTAACCACGATACGGCCATCTCTACGGACGCTATGGAGCGAGCTATAGGGCAGCACAAGATACTGTACTATGATATTGAAACAACAGATAAAGAT 

TTCACCGACAAAAAATCAGTCATCACATCTATTGGGTTCTGTTTGTGTACGGGAGGCGATATGACACATGGAGGAGAGAGAGGAGTATTTGGACTGGTTG 

C ACC TGG AT CCGACGTGG AAAAGGTG AAAG AGACT AT AAT AAAT T CGTAC GATCCTG AAG AAAAGG AAG AC ATT ATG AAACAGTGCC CT C AAGTGATTG A 

AATTTTC AC C AACGAGT TTG AAAT GTTGC TTGGTTTTGG AAAGT AT AT AGAT AAAGT G AAGC CTC AC GT G ATT AGTGGGTGG AAC AATGT AGCTTTTGAC 

GACCCCTTTGTCTTTACTCGTATCGTCAAACATTTGAGTGATCACACCAAAGACATGTCTTATTGTGTAGCAGATGCATCTACAGCAG 

CTAGAGCAACAGAAGGAGGAGGAGGAGGAGAAACTCCATATAGATTGAGCACCCCTCAAGAAAGAATACAACTAGCAAGCACTGGTATTTTCAATAAATT 

GGGAAAATTTGTAGACAAGAAAACTGGCATGTTGAAACCTGAAATGACTGCAGATTTATTG^ 

CGCAACAAGTTATCCTCCAGTAATAAAGGATCAGCAGGATGGTTCCAGAAAATTATTGGCGGTATGTGCAGTGCTATTCGGTTGGATCTCATGAAAGTGT 
GCGAAAAGGCCTATAAAGAATCCCTCTCTGAATTTAATTTGAACGCCGTGCTCGCCAAAGTGAGTAGTGTCGGCGACAAGGTTAAAAATGTAAAA 
AGTAGACCTACACTTTCATCTATTGGGATTCTTGAAGCTGAAGAAGGCCCAGGATCAGGCAAAAGTACACGTCTATTGTTGCAAGGATGCCTACTTGACT 
GGTATAGTTTCTACCTCCATCAACAAGGAAGGGGAGATTTTTAGGCTGTGTATGGACTCTGCTTTAACCGAGGCGGTCGTGACAGCCAACCTGGCCACTC 

ctctatgtataggagaaggagcaatctgtagaaatatgggagaagaaagggcagatagaagaggtgtgggagtaagaagacactctattgccacagacac 

aaagggaggtatggtgagtcaacctatcgtcaatcatgttccctatcaaacgattgacatgac^gtttgtacccgatgaccatgtgtcagaataa 

tgcaccactacctttgtgacccatcgacaaattatgcaactgagggatagattggtacttgaaaaaatgaaaaacaaaaccaccgactctttattgttgt 

tggacgttattgacgagtgcaatcagattgtgttgtccgagtacagacccattgatattgcagtcgcatcatggaagaatagcaactctaatagacaaac 

tccaattactcgcatagaggaaagtttgggtctaagattcatagaaaatttggatgccgagaagacaaataataaaacgtggtgcaccaatacatctccc 

aatatgaatgtcactgccgcaggtatggattacttccccgagattgtgtgtgacattaatatgcagtttgcggccaaggtgaatgatgatatgcatatag 

ccccagcaagtttagagtatatgcttcaagtattgcccataatgttaatcgacagaccgtacattggcgcacacataacagctggaaaatgtcgtacatt 

ggagg at attctctc ag aac tcg aaaagg acttttctgttgaaaaag atg aggaaat tat aag aac c c at tgg ac attt aagggtc aaaaac aat acgat 

ttct gtc at agtcctgtg ac c c aaatggctcgtc ac att att gaatc tac gggaagaaat atccgtg att at gaaggc aatg aaaaatt c gag aggttgg 

ttagcttatcagacagaatttatcgccg<:gttggcgcatttgattcgg<:caatgatccagctgttagactgtggtcttctcgtctaatcaatgttcgaat 

gttggttaggacatggaacgtaaaaactgacattcttaagggaatcatccctcaaatgcaagccacttacagagccgatcgagttgtgatgcagaacaag 

gccaaggagtttgccaagatgggagacatgaaaagagctggtctaaacaaagttggacaaaatattatgaagctcggtatgaattccatgtatggtc^ 

tggccctaagagcacgttcgagccgtaaagagtttgcgtctggatctgccaatactgcctcaagtatttccaacatgtcagccaccggaggaattggagg 

aggcacaaggcactcggtgacggccaatcaga1tacagaaaatgctcgatgtgtatttggcaatattggttgtggattacagatggctcttcctggtact 

aagcagacgtacggggatacagattctgtattctgcgtgcataatattgtaggtgatggaggaatgataccagaatatgatgaacaaactggcaaatatt 

AT T ATGTG ATGGAT AT TGCTC T AAAAAAT AAAATGGCTGC AATT ATTCC C AT CCT AGTC AACTC GTT AAC AAAGGGCATCC AGTT TGT AG AGCGCCGAG A 
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CGCTGGTGTGGGCATGATGAATATCGCCCATGAACGTCTAGCTGTCGCTGGTCTTTTGTTTGCCAAGAAAACATACCATATGCTTCACTTTAATGAAAAT 
AGTGCAGCGTTCAATGACATGATAAAATTGAAATCAACCGATAACAATAATAAGTTTGCATCCTTTATCAAGAGACCTAGCCACGCAGATGGGTATGTTG 
TTCCCCATAATCCTTCATTGATTCTTAGAGCGGCCGAAGGACCCGCTGCTAAGAAATTGAAAAGCTTTTTGGAAGAGGAAGGAATTCATGACGAGAAGAG 
TATGGAGGAATGGrTTACCTCTTCACCTACGTGGATGGCCATGGATGCTTCTGTTATCAACAACTTGTATGCCTCACAAATTGTAGGGGTGGAGAAGGGT 
AACTGGATTGACGCCATGACTTCCCGCCCCATAGAAGCGGGTACAGAAATGATGGAGGCGGTGACGCAAGCGAATGCAGCTTTCACCCCTTACAAAAAGG 
G AGCCTTTGTGAAG AAGGG AAT T AC AC C C ACC ACC AAAC T AAAGGGTCTCC AATC AT TGATTGC AAGATTTTTACC AAAAAT AG AGG AAAAG AAAAGC T G 
TTATTTGGATGTGATGAAGAATCATGTGGAGAATTTTGCATCCCATATAACAAATCCTGCTATGATGATTACTAGTTCCCGAGTCAACAAGTTTGATACG 
TCAAAAGAACAGAGTAGACCTAATCCTCTAGCTCTAGCGATAAATAACCACCTGAACCCTTCTTCAGAAATTTCATTGGGGCAGAAATTTAAGACGGTGA 
CATCAGTTTCTTCTTGGAGTCTTTCGGCAGAGGAAGGGGAAGTCCCTGCTGGTTATTTTAACGCTGGTAGCGTGCGTTGGGATGCCACCAACATGAAGGG 
AAGTGTTCCTGCATTTTCAGTCAAGAATTTATCTGTTGTGCCCAACGCCATCACATCTGTATACAAGATGGTTGAGAGCGATAAGACGGCAATAAAATCC 
ATGATTGCTAAAAATGTAGAAGTGTTGTGTTCTACATCTGCCAATACTGGATTTTCTCTGAGAAGAGGAGCATTGTCATTTAATACAGGCGTCATTGTTA 
CAAAGGACGTGGCTATGGCTTGTATACGATCTCTAAATAATAAACAAATGTTATTGTTTGTTGGAGGGGGAAAGGATTACGGTGAAGACGACGACGACGA 
CGACGAAGAAGCAGAAGAAGAGGACGAAGAAAATGGTGAAAACGAAGAGAACAAAGGTGACTGTGTCACGGAAAAAAAGATCCCTGGACGAAGCACTAAC 
AAGGATGTTGGTGAAGAAACTAAAACAAGCGAGAAAACGGAGGGAGAAAGAAAGGGCTCTAAGACGGCAAAG<KAAAGACC^AGGAAATTGCTAGrTCGT 
TGAGTAAATGTGGGAAGAAAGATGCGAGAGATGTCATTCTTGACCGTTTACTAAAAGCAACACATTCTTCrTGCACCAACAATGAAGAGAGAACCAGAGT 
CTTACAACAATATAGCAATTGTACATTATCTTCCTATATAACTTCAGTCATGAAATTGGACCAAAGAGT^ 

TTGGATCAAATACGTAATCTCTCCAACAAGAAGAGGCAAGAAAAGGGAGGGCCTTrTAAGTCTGAATTGGACGCCATGGTTGCTGCAGTTAAGGTTAAGT 

TTTTCCCAGTTTTAGATGCGTCTAGAAAATTGACTCAAGACCATTGGAAAAAGTGCCCCGTGTCCATCCCAGAAACGCGTGAAGAAAAACCATrAATGGG 

TGTGCCTTTTGAAGTTGCACTCAATTCTCTAATAGGAAAACACAAGTGCACAGATACATGCGACATGGCTTGTTGTCAATCATTGTATTTTGTCCTCTTG 

TACACGCTAGCTTTAAAATTTGAGAACGAAAGATTGGCCCGGCAAATTGG^CTAGATGACTCTGTAGATTTGATGGCTGAGATGTTGTTCGGAGGGGATA 

AACTATTGGCCCAGGAAGTGTTAAAAAGGGTAAAAGATGCTCAAGATAGAAAGTTGGTGAAATCTTTATTGCCTTTAAATTATAACCATGACACAAATAC 

AATTATATTTTTGTTTGAGTCTTTAAGGTTTGCTCAGAAACCTGTAGCTGGTATGAGTGTTAGTGAAATAAAAGACGCTGTTAGAGGTCTGGCCTTTTCT 

ACCACTACAGGTACTGTGTGGAATTATACTGATGAAAGATTTTTTGGACCATTGTATAACATGGATGAACTTTGTAACGAACGTGTCAATGGAAATTGTA 

AATTGTCCTTTATAACTGGTATTTATCATACGGCAGCAGTAGAATTGGCTGCTGCATGTCTATCTTGTGTTTTGTAAGAAATGATAATAAAAAACATGTA 

TAATTTCATTCTGGTTTTTATTTTATATCCTCAAGAGTTGTAGTAGTAGCAGCAGCAGCAGCAGCAATAGCAGCATTATTTTCAGCGACAGCTTCTTCAT 

TTCTATTGATTGATATCTGGTCATTTTCTACAGCCCATGGTCGTTTAAGGAATATTCTTTGAAGCAGAAACACATACCCAGCAATGATTGCAATGATCAA 

TATTAATAGTAGCGAATTAATTAAAATAAGCATTTTGTCCACATTCTGGCTAAAATAAAAAAGTTTATCTCTCAATGAAAAATTTTTATTGACCAATCAC 

TTCGGGAAGTTTTGCAGCATGACACACTATTACTCCAAAACTATGCTCACTCGTATCTAGTCCCACAACATCACGCGTCATATTTAAAACATGCACCTTT 

AAAGCCTGTTTTTTCTTATACAATTCTGTATTGATTATAGGGTTTACAACCTTTAGGCCTTCCTTTAGAGATACTATTTTCATGTATTTTACAATCGAAT 

ATGAGTTGTCTTGTGCACCAGACGGGAGCCTGTCATGAAACTCTACTTCTCTCCCACGTGTATTGAGCGAGAGATCATTAAGCACAACTTCTGAAGAAAC 

GAGAGGTTGTAAAATAGAGTTATATTTAGGTTTAATTAGGAAACATTTX3TGGAAACTTGAACTGTTAATATCAATAGAAACTGAGCTATATGCT 

ATGGCTTGCATTTTCAAATCATGTTCAAATCTTACTACATAACCCAAGAAATTATAATCTGCTTCTTCACCATCATCTTCTTCCATGGCATCCTCTCCTC 

TTGGCTCTTTTAGTACAATTTCTTTTGGTTGGTTAGTCATGTTGTATATGTTCATTGAGCTATGATTTCCC^CAACATCATCAAAGAAAACCCCTTCTGG 

GATAATGCATGCACCGACAGAGGTGTAGAATAATCCTCCATATCTAATGAC^CCGACTGCGGTAGCGTTCTCTTCATCATAGTAACCGAGCTTGATTTTC 

ACATGACTCATGGGTCCGATAGAGATATTGGAAGCCGGTTCAGTAGCGGAAAAAATAGACACTTGTTGGAACATTTTGTCGTCAGGAAGAGTCTCGAATA 

AAATACCGACAGGGGAATTATCTTCAACACTATTAAAGTTTGTTGCATAAGATAGCCGCCCTGAACTACTTGTATTATAAATTGTTATATCGGATGCTAT 

TTCAGAACAAAGATCTCCTATAAAATTGTGATTAGGGCAGACCAACATCCCXTCTCCAGTGTAAGACGATAC^ 

ATGGCAGCCGTTCGTGAAGGATTTCTCACGTCAACTCGTCTCACCTTCACTGGTACTTTTCCAAATCGTCGTGGATCTACATTGAAACTAAAATCTCTTT 

CGTGGGGATCACTrCCATCCTTTAAAGAGAAGAGAGGCACTCTGTAAACAGGAGACCAATCTGAATAGTACCTTAAAGCTGTTCTATTTTTTTCTTCTAG 

AGTGGGTACCGCTCTGCGGTGCATTCGGTACACCTCCAAATCCGTTTCCTTTACTTGAATCGATTTTGCTCCTTCATTTTCCCAGAGCGCATCAGAATCG 

TACCTGTCGTCGAACGAATCTTCACCTCCCATTTTATTTATTTTCCTCTATATGATAAAAACCACACCTAAAATGTCTAATATTTTACTACACTGAAAAA 

ATGGCGTACATTGACCAAGGGGCGTTTGGAGCCAAATCTGTGATGCAGCAACAATACCTTCTTCCACCCTCTAATAGGCTCTCCAAACAACAACCTCCAC 

TGGCATCATCTTCTCTTCAACCTTCCTCTTCTAATAAACCTAGGAGTACATCAAGAGTAACAGACATATTTGTTGTGATAACTTGTGTTGTTTTAAT 

TGCTTTTATAAGCAACTCATTTAGTGTTACAAAAAATGTGGTAAAACTTTCTAAAGAACAAACCGAGAAAATACTAGAAAAAGATTTGCCTGATAAGGTG 

TACAAATTGTTTGAAAATTTAAAGGATGGAACATTTGGTATAGGGGAAGATGAGGAGGAAGAAAGGGAGGAGAGGGAGGAAGGGGAGGAAGAGCTTGAAA 

CAAAAAACATAAGACTTAAAAGAAAGTGGAAAGAAATGACTGAACAAGGGGATCAAGGAATAAAAGTAAGGAAATTACATGGCCCGAGAGGAGAAAGAGG 

AGAAACTGGTCCAGCAGGAGCAGTTGGCCCTGCAGGCCCTCAAGGAGAAAGAGGAGCAATTGGACCGGCAGGAAAGGATGGAGCAGTTGGCCCTGCAGGC 

CCTCAAGGAGAAAGAGGAGCAATTGGACCGGCAGGAAAGGATGGAGCAGTTGGCCCTCAAGGCCCTCCAGGAGAAAGAGGAGAAAATGGACGCCCAGGAA 

GAGATGGAGCAGTTGGCCCTCAAGGAGAAAGAGGAGCAATTGGACCGGCAGGAAAGGATGGAGCAGTTGGCCCTCAAGGAGAAAGAGGAGCAATTGGACC 

GGCAGGAAAGGATGGAGCAGTTGGCCCTGCAGGCCCTCAAGGAGAAAGAGGAGAAAATGGACGCCCAGGAAGAGATGGAGCAGTa'GGCCCTGCAGGCCCT 

CCAGGAGAAAGAGGAGCAATTGGACCGGCAGGAAGAGATGGAGCAGTTGGCCCTGCAGGCCCTCCAGGAGAAAGAGGAGCAACAGGTATACCAGG AAGGG 

ATGGCGTGGACGGTTCTGTGGGCCCTCAAGGAGAAAGAGGAGAAATTGGACGCCCAGGAAGAGATGGAGCAGTTGGCCCTGCAGGCCCTCAAGGAAGAAG 

AGGAGCAACAGGACGCGCAGGAAAGGATGGTGCAGTTGGTCCTGCAGGCCCTCAAGGAGAAAAAGGAGAAGCTGGTAAGGACGGTTCTATAGGGCCTCAA 

G^AATACAAGGCCCAAGAGGAGAGACTGGACCACCGGGAAGGGACGGCACTGCAGCAGAAAGAGGAGAAAGAGGCTTCCCAGGACCACCAGGCCA 

GACCACCAGGAAAGGATGGTGTGGATGGTTCTGAGGGCCCTCAAGGGAAAAGAGGAGAAACAGGACCCGTTGGACCTAGGGGTGAACCAGGTCTAGCTGG 

CCTCCCAGGAAGAGATGGAGCAATTGGCCCTGCAGGCCCTCCAGGAGAAAGAGG 

CCCCAAGGAAGAAGAGGAGCAACAGGCCGCGCAGGAAAGGATG<MGC^GTTGGCCCTGCAGGCC*CTCCAGGAGAAAGAGGAGCAACAGGTATACCAGGAA 

GGGATGGTGTGGACGGTTCTGTGGGCCCTCCAGGAGAAAGAGGAGAAACTGGACCAGCAGGAAGGGACGGTTCAGTTGGCCCTGCTGGCCCTCATGGAGA 

AAGAGGAGAAAATGGACGCCCAGGAAGAGATGGGGCAACTGGCCCTATAGGTCCTGCTGGTCCTCAAGGAGAAAAAGGAGAAAATGGACGCCCAGGAAGA 

GATGGAGCAACTGGCCCTATAGGCCCTAGAGGAGAAACTGGTGCAATGGGAAAGAATGGCGTGGACGGTTCTATGGGTCCTCAAGGAAGAAGAGGAGCAA 

CAGGCCGCGCAGGAAAGGATGGGGCAGTTGGCCCTGCTGGCCCTCCAGGAGAAAGAGGAGAAACTGGACCAGCAGGAAGGGACGGTTCAGTTGGCCCTGC 

TGGCCCTW\AGGAGAAACAGGATTAACTGGCAGCCCAGGAAGAGATGGAGCAACTGGCCCTATAGGTCCTGCTGGCCCTCAAGGAGAAAAAG^ 

GGACGCCCAGGAAGAGATGGAGCAACTGGCCCTATAGGTCCTGCTGGCCCTCAAGGAGAAAAAGGAGAAAATGGACGCCCAGGAAGAGATGGAGCAACTG 

GCCCTATAGGTCCTGCTGGCCCTCAAGGAGAAACAGGATTAACTGGACGCCCAGGAAGAGATGGAGCAACTGGCCCTATAGGTCCTAGAGGAGAAACTGG 

TGCAATGGG AAAG AATGGT GTGG A CGGTTC T ACGGGTCC T C AAGG AAGAAGAGG AGC AAC AGGCC GC GC AGGAAAGG ATGG AGC AGTTGGCCCTGCTGGC 

CCTCCAGGAGAAAGAGGAGAAAATGGACGCCCAGGAAGAGATGGAGCAACTGGCCCTATAGGTCCTGCTGGCCCTCAAGGAGAAACAGGATTAGCTGGGC 

TGCCAGGAAGAGATGGAGCAATTGGTCCTCAAGGAGAAAAGGGAGAAAATGGACGCCCAGGAAAGGATGGGGCAACTGGCCCTATGGGTCCTCCAGGAGA 

AAGGGGAGAGACTGGTCCTATAGGTCCTGCTGGCCCTCAAGG AGC AACTGGTCTTCC AGG AAGGG ATGGTGTGGATGGTTCTGTTGGCCCTCAAGGAAAA 

AGAGGATTAATAGGGCGCACAGGAAGGGATGGGGCAATTGGCCCTGTAGGTCCTGCAGGCCCTAAAGGAGAAACAGGATTAGCTGGCCTGCCAGGGATAG 

ATGGAAAGGACGGTTCCGTGGGTCCTCAAGGAGCAATTGGACCTATAGGCCCACGAGGAGAAAGAGGAGAAACTGGACGACCAGGAAGGGACGGTGAGGA 

TGGT TCCACAGGC C CT AT GGGC C C C C AAGG ACTAAGAGG AGC T ACGGGAGCTCC AGG AC CGC AAGGAGAAAGAGG ATT AAAGGG ACGGC C AGGAAAAGAT 

GGTGAAACAGGTCCTCCAGGGCGACAAGGAAGGGATGGAATAATGGGTCCTAGGGGTCTTCGAGGAGAAAAAGGAGCACCTGGTAATGATGGTCTAGAGG 

GACCTGAAGGAAGAGATGGTGCACCTGGTCCCGCTGGCCCTATTGGACCTCAAGGAATAAGAGGATTAAAAGGTATCCAGGGACGACCAGGAAGAGACGG 

AGAAATGGGACCAGCCGGCAAGGACGGAATAGAAGGCCCTAGAGGTCAAGATGGAACAACTGGCGCTAAAGGACCTAGAGGATTAAGAGGTTT^ 

AGAACAGGAGAAACTGGTGCACAAGGATCTAGAGGAGAAAAAGGCGATAGAGGGCTAACAGGCCCTCAAGGAAGAGACGGTCCACCCGGTGAAGAAGGTC 

CTCAAGGTCTTAGAGGAGAAAGGGGAGCACCTGGCCCTAGAGGTCCTAGAGGTATTCGTGGCCGTTCAGGACCTCAAGGAAGTAACGGCGTGC^AGGACC 

TCGAGGTCCCCGAGGAACAAAAGGAAGAACAGGAATACAAGGCCTCACTGGCATAGAAGGTCCTCGAGGTCCTAGAGGTATACAAGGAAAGGAAGGAAGA 

ATGGGGAAAATTGGACATCGAGGAGAAAAGGGTGATAAAGGAGACCGTGGAGAACAAGGCATCGCTGGAGCAGACGGGGAAAAAGGTCCAAGAGGTTTAC 
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GAGGAATTCGAGGCCCTATTGGTGCTCCTGGTAAGCCTGGCACGGAAGGGGTT^^ 
AGGGGAATTAGGTCCCCAAGGACCAACAGGTCCTCAAGGGC^ 

GGAGCAGTGGGACCAAGAGGAGAGAAAGGAGGTAGAGGAAGAAAGGGAAAAAATGGCCCTAAAGGAGOSGAC^ 

AATAT TCAATCACCC ATGCTC G TGC AG AG ATAATGTGGGAAGGAAATGAAATCGG AGAAGCAT ACATTGGAAGATCTTAT GG AAC TG ATAC AATC CC TGT 
GATGATAGAAAATAGAATAGGGATGACAAATGAGGACAAAAAAAACGAATATTGTATACAAGTAATGACAATGCACTCAATAACAACTAGAG 
TCGGGTGTTTTTGTGGTAAGCAATAAGACAGATTATATCCTTTTAGTTACTTTACTGATGCCAGAAAGTGTTTCCTGTAGAACAGATGTCAG 
CG AGGTCAGAGAGGGTG AATGC TGTTAGAGAAAG AGAAAGCAAAT CGTACAGATT TATTAGGCCGT^ 

AATTGCCGTGGTAATGTATCCAGACGCAAGCATGAGTTACTCAGTTGATACATTAGACGCTGATGTGGCGCGAAGAGAAACAACGTCTGTGCTTTTATTA 
GCAGAAACCATACACGGGGAAAAAGATAGAGGTTTCTATGCTGATAGAGGAACTGTAGGGAGG7TGATGGTACCTCCCACTGAAGAAGAGTTATTGGTAT 
TGCAAGC 
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CT1035 
Nucleotide 
Genomic coordinates: 
Start: 130589 

Stop: 131444 (SEQ ID NO: 2) 
Amino Acid 

MAAAKMDAILADINGNDTDLSKLITDVIQKRAKAVMDRNRAKMDMNRRVDEAIQEAVAAK 
KQKALVVFDKLVEETDSGQSVPPTLSGSDYDAWVDRAMPSHIELVESVEGDSLYDKLPPF 
NVQDIDDQIGDEIDTPISYLAMVVVKVDCETGDIEEEYNLAPTFGVTQNNKIYRDERDQI 
FTKADKSVRIFKLAKLDSISGKSRQLTYAVKNNNEYTEFVCSVFAEFESDSDTTKSGIGI 
REYDKPKNEFEYEEREIFTFFIPIQPAGTKLLLYFLVDVRSRII 
(SEQ ID NO: 3) 

Top Blast Hits 

Sequences producing significant alignments: 

Q65326 (Q65326) DNA POLYMERASE { FRAGMENT ) 
MYSB_CAEEL (P02566) MYOSIN HEAVY CHAIN B (MHC B) 
002244 (002244) UNC-54 PROTEIN 

004327 (O04327) CELL DIVISION PROTEIN FTSH ISOLOG 
Q04009 (Q04009) MYOSIN HEAVY CHAIN 
002077 (002077) COSMID C48E7 

Comments: 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 1 to 765 of CT1035: this 
corresponds to nucleotides 130649 to 131413 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=350 

5'stop=373 

3»start=405 

3'stop=427 

5 • pr imer=TCCCTCCTTTTAACGTACAAGACA (residues 350 to 373 of SEQ ID NO: 2) 
Tm5=57.95 

3 ' primer =CCATGGCAAGGTAAGATATTGGT (residues 405 to 427 of SEQ ID NO: 2) 
Tm3=57.97 

probe 1=CGGTGATGAGAT AG ATAC (residues 387 to 4 04 of SEQ ID NO: 2) 

probelstart=387 

probelStop=4 04 

direct ion l=Forward 

Tml=69.08 

scorel=l . 91 

length=78 



Score E 

(bits) Value 

36 0.31 

34 1.6 

34 1.6 

33 2.1 

33 2.1 

33 2.7 
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CT1037 

Nucleotide 

Genomic coordinates: 

Start: 131480 

Stop: 132941 (SEQ ID N0:4) 



Amino Acid 

M FVI S I ATSLVLFFFLLFVS I T I LDGAKTI DSQPFRKRRKRKRYRTSESGDG I DGGTGTT 
NGGGGGGGEGGGGGTNGNGTGTTNGGGGGGEGGGGGTNGNGSGTTNGGGGGGEGGGGGTN 
GGGNGNGGGNGNGNGNGGDTDTDDFEPTPALLKERLLNSISSKPKEYYEAFVSAEVETAL 
QLSRDDSTQTI IIDDDQLELDASDTLQGKPRDYLFKLAGVSSAFLEGTTIRKAEDRARNI 
NEEEIAQTILSQLREKHINDEYDGKYATPEERADFSNSLNLVTKYTNHEVGLLVGETIEK 
AFPHEIEFERCIILVEDFNSGTITSNTMQYRSNAYKIRVVEGSTTDPGEVVPDDCLVFAV 
VWKEQHSLEISATNRCQDICFVIIPRLSAIGKNATMVIRKGDEIKQETYLFVANKNDTT 
HFS 1 1 T DKDESVGI ELNML I FS ERI LPTLSDPATVPRPLTDANVLSAYGKRLGVGAFTDK 
NLLSSQ 

(SEQ ID NO: 5) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

065514 (065514) PUTATIVE GLYCINE-RICH CELL WALL PROTEIN 120 3e-26 

Q21835 (Q21835) R08B4 . 1 PROTEIN 112 6e-24 

Q43522 (Q43522) TFM5 GENE 110 2e-23 

053553 (053553) PGRS-FAMILY PROTEIN 108 le-22 

065450 (065450) GLYCINE-RICH PROTEIN 107 2e-22 

GRP1_PHAVU (PI 04 95) GLYCINE-RICH CELL WALL STRUCTURAL PROTE 106 3e-22 
Comments : 



EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 392 to 1365 of CT1037: this 
corresponds to nucleotides 131892 to 132865 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5*start=723 

5'stop=74 5 

3»start=792 

3»stop-811 

5' primer =TGAGGAAGAAATTGCACAAACAA (residues 723 to 745 of SEQ ID NO: A) 
Tm5=58.39 

3* primer =CCTCGGGTGTGGCATATTTT (residues 7 92 to 811 of SEQ ID NO: 4) 
Tm3=58.37 

probe 1=CAACGATGAATACGATGG (residues 774 to 791 of SEQ ID NO: 4) 

probelstart=77 4 

probe 1 Stop=7 9 1 

direct ionl=Reverse 

Tml=68 . 98 

scorel=l . 98 

length=89 
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CT1038 

Nucleotide 

Genomic coordinates : 

Start: 132993 

Stop: 133896(SEQ ID NO: 6) 
Amino Acid 

MGDKQKVEQLLRELKAEANDDWLSANVDPIVERFVTTKSDETAQVVKQAVDEKYDELLED 
KVEEMRPDIINEASETYDKLAADMIREVDTSSVIAPAIAGTVARTINNLRDKRKEYEKRL 
WTIAYKPWRRYVQAITVMEFRLSYKDLTVHANSDTYLTFPFLRIKKIAYINNDRASPVNC 
SLSVSYPNKSEWGNDNGVGRKVDIHIRRNDLQEKDLYLSVICMLDTDFSGYDKAVEVDAH 
KFHFEAGNRTMFLPKTSNLFNRSHIVNSKICTIVFPPASASSASTTELDNVYYRITCTCS 
(SEQ ID NO: 7) 

Top Blast Hits 



Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



RESA_PLAFP (Q26005) RING- INFECTED ERYTHROCYTE SURFACE ANTIG 
Q9X9G6 (Q9X9G6) HYPOTHETICAL 35.5 KD PROTEIN 
CAA21397 , (CAA21397) ORF74 

Q9ZFZ9 (Q9ZFZ9) HYPOTHETICAL 21.1 KD PROTEIN 

RESA_PLAFF (P13830) RING- INFECTED ERYTHROCYTE SURFACE ANTIG 
Q9ZH03 (Q9ZH03) LAMBDA HOST SPECIFICITY PROTEIN J 



38 
37 
37 
37 
36 
36 



0.12 
0.15 
0. 15 
0.15 
0.26 
0.45 
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CT1039 

Nucleotide 

Genomic coordinates: 

Start: 133968 

Stop: 136344 (SEQ ID NO: 8) 
Amino Acid 

MEEESQRVQRRIGVLPEEAASQILKDTKLRVSYLGVGHWGYSVSVIKSALQKGCRRNDED 
ITAWSIREAYLYYHLGLNYIENVKPAAKSLNTNMVNRIKIIAVEDTSPRSMVASNECVRT 
LEKYEKGNFRQPSYLMDAAMRLVHASSSRVCSHMRALCCKEEDSDKLGGI YYAN FNELET 
QCVSAVNFSPIERIKHVFREIESVKLGKKSVQLLNLRSVAAYHVLRYYGDKVKDTNKKHS 
GPFKRKEFEQFWGLCFKFVTQHVKTDPELRCYFNELTYAINWRRDFFCSKGFFREESLFL 
TSIVELI IAMCIGDRKQFAKIQKRDLKRFNKGEEGRKEEAATFDWIEGHVKRMPQMPVWV 
LDKHTNKNTHGVSFALESSMVSGGDKRWSPGVWLHSYTKMRLDSPPPPEVGQFLDQAFNT 
LKREAASHCVTRNICTTTGFIKASSFTANINSEPMEIKEEIKKRKIEIKDDNTTATVTVS 
ATTSSSITSTPPPTKKQKTTPSGSNKVDSIQLNNLPTLNMEDLDRVLEVHNQNSKKGVAA 
TVLMKDGNKVVFKEMRKSFGWGSHQNFVQVLKDEDVCKLDYLLPCPDSGPYRGLYRCYFK 
I VKDE I S ST AAR I EKVKWGEN AMCY F I SGCVTRQEG I GK 1 1 T DVRLSHMG PNKQ Y V YDN Y 
RQLIH I LI FRLLTGVS DTNTSN I LVGDGGNLFS VDEN Y VGAKDPRTALENRKI KELQLLL 
KTSFKVNKVTKEDIDSCLPSWLFDTSKSDKIMNGVCNIGKNMGIGPTTLDIVKNNCTCIL 
GVVNDLLYDNK 
(SEQ ID NO: 9) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9Y1H4 (Q9Y1H4) GAG 43 0.011 

RAP1_YEAST (P11938) DNA-BINDING PROTEIN RAP1 (SBF-E) (RE PRE 41 0.031 

SLY1_YEAST (P22213) SLY1 PROTEIN 40 0.070 

P90603 (P90603) MUC.CL-l 38 0.20 

052224 (052224) GLUCOSYLTRANSFERASE (EC 2.4.1.5) ( DEXTRANSU 38 0.35 

Q20202 (Q20202) F40E10.5 PROTEIN 38 0.35 
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Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 1092 of CT1039: this 
corresponds to nucleotides 1338 66 to 134 957 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5 , start=1255 

S'stop=1277 

3'start=1353 

3 , stop=1375 

5*primer=AATACATTGAAGCGAGAAGCTGC (residues 1255 to 1277 of SEQ ID NO: 8) 
Tm5=58 .71 

3 1 pr imer=CCTTAATTTCCATCGGTTCAGAA (residues 1.353 to 1375 of SEQ ID NO: 8) 
Tm3=S8.18 

probe 1 =TAGTCATTGCGTAACGAG {residues 1278 to 1295 of SEQ ID NO: 8) 

probelstart=1278 

probe lStop=12 95 

direct ion 1 -Reverse 

Tml-69. 03 

scorel=l . 96 

length=121 
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CT630 

Nucleotide 

Genomic coordinates: 

Start: 249215 

Stop: 247358 (SEQ ID NO: 10) 
Amino Acid 

MALKDAFTERLVVNKVGSGTCWAPVVEDDRQKSLFQKVENLYRVLVVEQKNSAITLSGNK 
NTNKRQCRQVEEDKVIFEGEDRTVSNLPQAVKETIAANAESILDYWYKNVIPLLDTKKER 
SGKSDTFLRTAVICLVRCCVSYKDMKTCSLIYEFEHKILNKSTLDPLLKDILDNKQELLH 
MDSKYGSKTTSPELAKETIEALYTTVYNHWTNAFKLYQASLTHKPVTGKKYASVIHFIRT 
WRKIVKAYVSKHNNVERDLSLKNIMKNESADNANVLTIEKMYKKIGNSVKNTNNNSAHQM 
SDSEDDDDDDDDDCEGMDVCDEASEREKKHQESLYPINTPVTTITGDYIFKVLLELVLSP 
HIHPEWKIPMCDFVNRNIPKLMKAMETDISNAVIEVRASKVNPVQILPIAANFWDFCKSG 
KPPSDVKFCMMFNEPSSNETLSSGAGVFGRFIGGPFSHKSKELDIISNCLRSLLLNKEAD 
NLSTRI WREGGS VVCFN YCP I T ARGAVLGYGEQLSERS I KALWAKKIQDAVTES VKRQRN 
AADKNSRNCDLLGDEGVVSMKTVTFGCANMLKTQNGMGKFNVWSFEDSIQANKEGAARQ 
YMSQQVFTHSFPALDQGK 
(SEQ ID NO: 11) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

097318 (097318) PFC0865W PROTEIN 45 0.002 

YB00_YEAST (P38114) PUTATIVE 12 6.9 KD TRANSCRIPTIONAL REGUL 4 4 0.003 

Q082 81 (Q08281) CHROMOSOME XV READING FRAME ORF YOL138C 42 0.010 

Q92271 (Q92271) 12.8 KBP FRAGMENT OF THE LEFT ARM OF CHROMO 42 0.010 

062235 (062235) F36F2.3 PROTEIN 41 0.018 

Q83970 (Q83970) (CPV) 41 0.023 
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CT631 

Nucleotide 

Genomic coordinates: 
- Start: 264975 
Stop: 259164 (SEQ ID NO: 12) 

Amino Acid 

MGSKRPCSSGQEPVTKKQKKNNNNNSNPVPVINIKSYPFLATRTQVLRSAVAAAAASPSG 
SSSSSSSSASAVKLPDTCKEARKVLSTVSLQQSLAVRYLCNSISVSYAGGGISVFHLGGL 
PGAGKTTMVKELIAVLNDHGLIDSGSADMLLCCKSNSAKESLMCACKKPGGSSLMYPESV 
FSTLNKGFEIPVIFRKDEITLEKIQFVADKLKWKVIQVLANLRFLVIDEYTMASCRELVF 
IDAVLRIAKHRPDIPFGGVFVILLGDNRQNSAVVEDNTNHIQKKIKNPSEEEKPQKNNKN 
NKNKKKKKEKKEKGGEEEEGDENEEEEGEEEEEEEESDDEAETKKEEEKSTFFQGSVEQD 
NFGQEDNAKLYTEVFIKILKMFCSRDFFGNPSNLRNIVNKRHEAILMKSNNVKSVNNNLV 
SSAIKVEDCGNASNKKEVTAPSSSPAQSTAEENCDEFDDEEDDDFFNNEAFLKLMERNAL 
EKDRASGALNGFSLRCKSISDANEKIRSGTTSVSDKKSSLDMMKSLPLSALIEEGICSEL 
AHISELKKMSNANLEKYTENVCSIVFDMMAKAMREIDYSGREKLYIVSSLSERFKDTHLT 
SLMDEEILNVKYVHGSDPKCIDAVPFNSAHNRASAVAACVRNAFFRDGKDFVDETPIANY 
FKDNLRTVASFLENETLTYKELLAKSENIRSILLKKETGNNNSASSRTAAAAAYYEDDED 
YCYFDEEEAMDLEDGGSGGSGMKSSGGGGDDDDEESGEMIYRTDIPDKLHRDASTLDRVG 
HLVD FH VVWKKWLTENKPS DLVRARV W YLYT LVRMQQ VKFDNGKLPS L DKSALSGRL FH S 
PSEWATSTGVGVGGGGGAGADKPLHDEYWLRVLSMPISTGGDVGKSMLLPAYSSYLSALS 
RTYIMSSLKRIDIIKHAYSLMYGISLFDMTANLQDLVDTRMAGRSSRNGSVFMDNFDPVQ 
YFDNIFPSMVNEFLMYRKEDVFNNGQMMEGVKGSLKISRVLQTAHTENNNINNRHNNSLK 
YSEKSIVLAMQMVTSISKGNERRKKIEEFITKEQGQPKDMCERLMANSKAKQEKDAISSK 
TDKMMGAITLTKKHVLKNAVSNLVDTS 1 1 KETKKNNNNSSSSSSTSLAAAAAVENSVPAL 
RVEVKFVVLNMDLSDISHEKTISHKYRQQLINAIKTRSTPLFDKFTDRKILRAAESPRAL 
TTILLDEKKKVTRAKSITLYQGQNVI FTTSNRMIHGTQERFVTKDTGVVTNIiMYKNGELT 
VFVYVERLGQKCLEIKEGRQIIGNPNIKNGGFGNNVYVQYLPFESSQAMTIYSCQGHTFF 
RDTIVDLSGASTQDAYVAVTRNSNPQNLFIIQNHSVERGNLCNIKCAMSKDKAYTMPIGG 
IADFNGSDFINHDTVSVSREVAESSAAMDDDYNGDGGVTMYSAYDPSKDVVAAAEEFILS 
RSGKSLSFNASWMANTAKVIQQHGLETELKNIRDFFFGVNNGDVAKHYEKLCNKKMIELY 
TAIVRSITHYSIASGIVKQPSSKLCEEYETKQKNKKDYIKIHPVFVNRAPKESTIEMLLF 
DIAPHNKATIVFQFYVHYIFLVYEKLNVLNSSFAFLPSPNFCLNQYVRPKSITTNSTHVP 
NLGYESKDFAHCKDGGERDVKLRLPITSADEFSNNIEGILKKVSDTSNQNKVNKYMDVVC 
KSMQHNLRRTGKFCRPTETCGLSKHGSIVTSTCTAQEKGENIHVDAEKGWLCMSDEANVY 
CMLMFMSKIA7VASGVSEFPIKDKSILESNPETPSDTISLLAPRKTISPTNNLHFSMSEDV 
LFCGQVHPMKRVQFSLHVKRTGGALKSTFEEEEGLPTKIFSPNFATYPLFKKCKMYGAII 
IAMTEMQGHEFAKYSTLDIRKSMFTGVGTVVDLEKISGEGNEVMDKVDKFIVKNVSNILF 
KEQGKR V S F F V S C A I H 
(SEQ ID NO: 13) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9YTL7 (Q9YTL7) ORF 48 68 5e-10 

035788 (035788) CYCLIC NUCLEOTI DE-GATED CHANNEL BETA SUBUNI 68 5e-10 

NAB3JYEAST (P38996) NUCLEAR POLYADENYLATED RNA-BINDING PROT 68 8e-10 

Q07034 (Q07034) RNA BINDING PROTEIN 68 8e-10 

096229 (096229) HYPOTHETICAL 78.6 KD PROTEIN 66 2e-09 

096134 (096134) SER/THR PROTEIN KINASE 66 3e-09 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 830 of CT631: this 
corresponds to nucleotides 259304 to 260133 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5»start=2408 

5'stop=2429 
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3'start=2489 
3'stop«2509 

5 , primer=GAGCCCGTGTGTGGTATTTGTA {residues 24 08 to 2429 of SEQ ID NO: 12) 
Tm5=58.49 

3 * primer =AGCGTCCAGACAGTGCAGATT (residues 2489 to 2509 of SEQ ID NO: 12) 
Tm3=58 . 56 

probe 1=CACTAGTGAGAATGCAAC (residues 2432 to 2449 of SEQ ID NO: 12) 

probelstart=2432 

probelStop=244 9 

direct ionl=Forward 

Tml=68. 99 

scorel=l . 99 

length=102 
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CT632 

Nucleotide 

Genomic coordinates: 
Start: 268485 

Stop: 267717 (SEQ ID NO: 14) 
Amino Acid 

MAGVDLYGGHIKPYGETVFNNKMQGNRGKIRALINEKAAATLPMSEDNISAWVTEVAADV 
FPDPKSALTFFVPNKSLNAFAWDVLKTPASVEIDIGKRIPQLIENLHMSDFTVAIFRVKC 
DDQGRYETSYNLSPSMGGKINHGLIRTLAKAQDIWWKRDFSLTIENFEVDNGKKRLDFL 
FNNQTDKSCFVKIFHEMESEKDIAIKPEKRGSSAVWDEVYSDIVTKNTRNAKFSLRYRNE 
KPVDHLLLYCMVTYF 
(SEQ ID NO: 15) 

Top Blast Hits 

Sequences producing significant alignments: 

097036 (097036) PLC-BETAH2 
045329 (045329) F09C6.2 PROTEIN 

P97868 (P97868) PROLIFERATION POTENTIAL-RELATED PROTEIN 
P70287 (P70287) RETINOBLASTOMA BINDING PROTEIN 6 (PACT) (FR 
AAD4 9229 (AAD4 9229) EHEC FACTOR FOR ADHERENCE 
CAB55629 (CAB55629) LYMPHOSTATIN 

Comments: 

TaqMan Primer/Probe Sets: 
S'start=378 
5*stop=403 
3 ? start=452 
3'stop=477 

5 1 p r ime r = TGAAACCAGCT ACAATTTATCTCCTT (residues 378 to 403 of SEQ ID NO: 14) 
Trn5=57.68 

3 ' prime r=CCTCTTCC AG ACT ACAATATCTTGGG (residues 452 to 477 of SEQ ID NO: 14) 
Tm3=59.43 

probe 1=TCAGAACACTGGCTAAGG (residues 434 to 451 of SEQ ID NO: 14) 

probelstart=434 

probelStop=4 51 

direct ionl=Forward 

Tml=69.15 

scorel=l .84 

length=100 
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Score E 
(bits) value 

34 1.5 

33 2.5 

32 5.6 

32 5.6 

31 7.4 

31 7.4 
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CT633 

Nucleotide 

Genomic coordinates: 

Start: 266790 

Stop: 266442 (SEQ ID NO: 16) 
Amino Acid 

MASPLVASLGGGKNILFGLLLITIIVIVIAVIIIKAPLLASLLAGTALAGTIASALGSIP 
GVGGAFKKAFGKGKGKGGPKTPDGGAKKTNQKPKKGKKKPPTRRSIFKRIPKIKF. 
(SEQ ID NO: 17) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q02391 (Q02391) CYSTEINE-RICH FIBROBLAST GROWTH FACTOR RECE 36 0.092 

Q91019 (Q91019) MUTANT CYSTEINE-RICH FGF RECEPTOR 36 0.092 

Q69526 (Q69526) GLYCOPROTEIN B 36 0.12 

Q9XZ15 (Q9XZ15) HYPOTHETICAL 29.3 KD PROTEIN 36 0.12 

Q14113 (Q14113) AORTIC CARBOXYPEPTIDASE-LIKE PROTEIN ACLP ( 35 0.21 

P79922 (P79922) MODIFICATION METHYLASE (EC 2.1.1.73) (CYTOS 35 0.21 
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CT634 

Nucleotide 

Genomic coordinates: 

Start: 283360 

Stop: 282673 (SEQ ID NO: 18) 



Amino Acid 

MVSSRTSTTSSSAVAATSTLLPTKRKREPEEVKVKVEVKMEQEELVEDSSSNKRPRIKEE 

KEEEHKETHHLSLPCKEEEDDGEEEEYEEEEDEEEYEDRVDDDTAEKMENLLVQLDNTTK 

NIKLKNPLREHDMAVSHYEHEFEVQNTVNFSFGVLSDIGFLINREAVSRWGNTPPPKEFG 

DMEIGSLTVNQLLHKCDNFVQAVVQKVKEDITPSIEVTIDSLIDDPCW 

(SEQ ID NO: 19) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

035788 (035788) CYCLIC NUCLEOTI DE-GATED CHANNEL BETA SUBUNI 60 2e-08 

GARP_PLAFF (P13816) GLUTAMIC ACID-RICH PROTEIN PRECURSOR 57 8e-08 

Q9YTL7 (Q9YTL7) ORF 48 56 2e-07 

094922 (094922) KIAA0835 PROTEIN 55 3e-07 

008995 (O08995) MYELIN TRANSCRIPTION FACTOR 1 55 3e-07 

Q93424 (Q93424) HYPOTHETICAL GLYCINE-RICH 37.0 KD PROTEIN E 55 5e-07 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 575 to 1 of CT634: this 
corresponds to nucleotides 282678 to 283252 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=210 

5'stop=231 

3*start=287 

3'stop=308 

5 ' prime r=CCTCTCCCTCCCATGTAAAGAA (residues 210 to 231 of SEQ ID NO: 18) 
Tm5=58 .19 

3 ' primer =TCGTCGTCCACTCTGTCTTCAT (residues 287 to 308 of SEQ ID NO: 18) 
Tm3=58 .40 

probe 1 =AAG AGG AGG ATGAGGAAG (residues 2 66 to 283 of SEQ ID NO: 18) 

probelstart=2 66 

probelStop=283 

direct ionl=Reverse 

Tml=68 . 99 

scorel=l . 98 

length=99 
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CT635 

Nucleotide 

Genomic coordinates: 

Start: 285773 

Stop: 284075 (SEQ ID NO: 20) 
Amino Acid 

MARSVGLLSVTPEYDTFKYIKMEEFKTLKVKNGFTISGENPDKYEHILLSFKSVDRVTKS 
ELRDGLYIVRLKDKEVLHIKNGVHRLRQLTGDNTLQVGLKYTHNLPRLGSLLQDDGCEDY 
GEKWNESLPIDMQNINKIVKEKALLSDKNFKFSPLYRLLHERLSNAAVKKCDYMIITTDF 
LVGCGYTPSHCPRTLRNMEQLLVEQCGFSSRISVYDICDRLTYKGAYIANPITGSYSNMC 
LIVPMDKLGLIFYNSTHPSAKSIGNYMSSLFNATVIYANERDNLQMDNFRREIKFAENEV 
NMKEEELKELRKRCAVSEEQRISLRDVHKKSSIATSRYDGGACLVFAFSDRDFSLLCRTN 
GNGSFYSATEEGIRYVSSPEYKKRDVGERRPRLIMSITGSDAPICIRDSVRNHFKTRLFS 
RTSGNSITFAVPPGERELMEMVREVTGTDIKI FMDNGKVYQNGAEINVI DPTSKEYKELL 
KREENLPEDERKRLRRERRMI FNTSRAI SMYNEERGDGGSGGETSEDGDGNGSTSSKGEK 
RKREENEGNEY VVLLNKACKDI KVC 
(SEQ ID NO: 21) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q21885 (Q21885) COSMID R09H3 37 0.32 

077336 (077336) PFC0425W PROTEIN 37 0.42 

000164 (000164) RIBOSOMAL RNA UPSTREAM BINDING TRANSCRIPTIO 36 0.71 

UBF1_HUMAN (P17480) NUCLEOLAR TRANSCRIPTION FACTOR 1 (UPSTR 36 0.71 

YPT2_CAEEL (P41880) HYPOTHETICAL 21.6 KD PROTEIN F37A4 . 2 IN 36 0.93 

YG2K_YEAST (P53253) HYPOTHETICAL 106.5 KD PROTEIN IN CTTl-P 35 1.2 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 501 to 1 of CT635: this 
corresponds to nucleotides 2 84 04 7 to 284 54 7 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5 f stop=706 

3'start=799 

3»stop=822 

5 'primer=CAAACCCAATCACAGGCAGTT (residues 686 to 706 of SEQ ID NO: 20) 
Tm5=58 . 55 

3*primer=GGTTGCATTGAAAAGAGATGACAT (residues 799 to 822 of SEQ ID NO: 20) 
Tm3=58.19 

probe 1=ACTCCAACATGTGCCTAA (residues 707 to 724 of SEQ ID NO: 20) 

probelstart=707 

probelStop-724 

directionl=Forward 

Tml=68.89 

scorel=l .89 

length=137 
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CT636 

Nucleotide 

Genomic coordinates: 

Start: 286706 

Stop: 286076{SEQ ID NO: 22) 



Amino Acid 

MIVFVEGSPLTGKTSWVDNMRTAGKGKQSFLNFMYTNYRDYLPI FPWTIQEHLRASDYQE 
RPRLVDGMFGSSLNFFTGMWRHDTEQFPESKIGLREYLEMYGEEFKACVAEWVKYKPVFH 
VMVYREEDVKKMEPIIQELNDAHNWFI DVLKEERALFVKIEVI PRNVYKGNICSSCFSTS 
KNYVYRVGKCTNSIVHCDMKCKFIAEKII 
(SEQ ID NO: 23) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

P87968 (P87968) ENVELOPE GLYCOPROTEIN , V1-V5 REGION (FRAGME 33 2.0 

P88315 (P88315) ENVELOPE GLYCOPROTEIN (FRAGMENT) 33 2.0 

P88314 (P88314) ENVELOPE GLYCOPROTEIN (FRAGMENT) 32 2.7 

DYHC_PARTE (Q27171) DYNEIN HEAVY CHAIN, CYTOSOLIC (DYHC) 32 4 . 6 

040068 (040068) GP120 (FRAGMENT) 31 6.0 

040067 (040067) GP120 (FRAGMENT) 31 6.0 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 594 of CT636: this 
corresponds to nucleotides 286111 to 286704 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=239 

5'stop=259 

3' start=317 

3 f stop=335 

5 , primer=GGAGGCACGACACAGAACAGT (residues 239 to 259 of SEQ ID NO: 22) 
Tm5=58. 81 

3 1 prime r^CACTCGGCGACACATGCTT (residues 317 to 335 of SEQ ID NO: 22) 
Tm3=5 9. 42 

probe 1 -TGGAGATGTATGGAGAAG (residues 293 to 310 of SEQ ID NO: 22) 

probe Is tar t=2 93 

probe 1 S top=3 1 0 

direct ionl=Reverse 

Tml=68 . 95 

scorel=l . 95 

length=97 
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CT637 

Nucleotide 

Genomic coordinates: 
Start: 300432 

Stop: 299085(SEQ ID NO: 24) 
Amino Acid 

MGGEDSFDDRYDSDALWENEGAKSIQVKETDLEVYRMHRRAVPTLEEKNRTALRYYSDWS 
PVYRVPLFSLKDGSDPHERDFSFNVDPRRFGKVPVKVRRVDVRNPSRTAAIFVPTGPGLH 
VSSYTGDGMLVCPNHNFIGDLCSEIASDITIYNTSSSGRLSYATNFNSVEDNSPVGILFE 
TLPDDKMFQQVSIFSATEPASNISIGPMSHVKIKLGYYDEENATAVGVIRYGGLFYTSVG 
ACII PEGVFFDDWGNHSSMNI YNMTNQPKEIVLKEPRGEDAMEEDDGEEADYNFLGYVV 
RFEHDLKMQAMSSAYSSVSIDINSSSFHKCFLIKPKYNSILQPLVSSEVVLN DLSLNTRG 
REVEFHDRLPSGAQDNSYSIVKYMKIVSLKEGLKWNPIINTELYKKKQALKVHVLNMTR 
DWGLDTSEHSFGVIVCHAAKLPEVIGQ 
(SEQ ID NO: 25) 

Top Blast Hits 

Sequences producing significant alignments: 

023592 (023592) CAR BOXYL- TERMINAL PROTEINASE HOMOLOG 
YGA1_YEAST (P53199) PUTATIVE 3 BE T A- H Y DROX Y S T ERO I D DEHYDROG 
VILI_DICDI (P36418) PROTOVILLIN (100 KD ACT IN-BINDING PROTE 
Q9Z5B9 (Q9Z5B9) PUTATIVE TRANSFERASE 
Q9YUY4 (Q9YUY4) ENVELOPE GLYCOPROTEIN 

Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 3 to 1254 of CT637 : this 
corresponds to nucleotides 299132 to 300383 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=678 

5'stop=702 

3»start=788 

3 , stop=812 

5 • pr imer=CGGTGTCATTAGATATGGAGGATTA (residues 678 to 702 of SEQ ID NO: 24) 
Tm5=57.26 

3 1 pr imer=TCTTTTGGTTGGTTAGTCATGTTGT (residues 788 to 812 of SEQ ID NO: 24) 
Tm3=57.75 

probe 1«CTACACCTCTGTCGGTGC (residues 7 05 to 722 of SEQ ID NO: 24) 

probe Is tart =705 

probelStop=722 

direct ion 1= Forward 

Tml=68.85 

scorel=l . 85 

length=135 
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Score E 

(bits) Value 

34 1.6 

34 2.1 

33 4 .8 

32 6.3 

32 8.2 
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CT1040 

Nucleotide 

Genomic coordinates: 

Start: 137588 

Stop: 139940(SEQ ID NO: 26) 



Amino Acid 

MAAAAVSGEGRISADLLLLEQLTPDGDVIRY DSEQYTKPRKIFGDKSVIETIGHFLIHNH 

NQGESYQIASSVLEKFPALLNCIWNGESGGMALWKALYRAKKYRLLNSLLVHKIKNWPSV 

AVIPIYGSVCDREERPIIMSEI I DKETLQTICKSDIRSLLGMMNAKHGTLGGNFLHFYAR 

STKPFENFQYEAMGANAVLMAAEAIYDGFRDHGLNPSEYTFPGLESADVYGNNPVEIAIS . 

GDDDNMLLNLICNYGVSYEKTRGRVNRSLLDFLKMNTASKCLSVLKFVEKHFKIESNTPK 

GEFEEKAETCVNCLDRNNVLTKGSEQESYKLSCGHFLHVKCLRNICIVSQHLRCEKCLKR 

FDES I LRKCT PNLNWWLTMP AGAGNEEE I CFMRNKKLVDDFRKLLS PVS I PHFFKNSRQR 

NLDMLCPYSDHTIIPNKEDPKKNEDGNRVRVNHTAISEKQNKEEEDARIKRVAVRTFTAI 

REKQNKEEEDARIKRAVDMAVAAINEKNKEEEDARIKRAVDMAVAAINENNKEEEDARIK 

RAVDMAVAAINEKNKEEEDARIKRAVDMAVAAINENNKEEEDARIKRAVDMAVAAINENN 

KEEEDARIKRAVDMAVAATNEKNKKEEDARIKRIIDLTVDMRIQRIVDMAIAAATKKDKK 

EEEKRTKREQELRADLRRAMDMVNEVQKKLEDMELEKGCNKDEAKNTSNVVSSSSVVAYS 

KEIVPCLGNNNNAVIGMTSTNYSANNTKNNVFGSPHKFSFNDASRFSNIVETPKMSFNFS 

FKT 

(SEQ ID NO: 27) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



Q9ZU69 (Q9ZU69) PUTATIVE VICILIN STORAGE PROTEIN (GLOBULIN- 71 4e-ll 

Q26775 (Q26775) TB-292 MEMBRANE ASSOCIATED PROTEIN 70 5e-ll 

INCE_CHICK (P53352) INNER CENTROMERE PROTEIN (INCENP) 67 5e-10 

YDF3_SCHPO (Q10475) PROBABLE EUKARYOTIC INITIATION FACTOR C 66 7e-10 

Q26774 (Q26774) TB-291 MEMBRANE ASSOCIATED PROTEIN 66 9e-10 

P91257 (P91257) SIMILAR TO C. ELEGANS UNC-89 65 2e-09 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 853 to 415 of CT1040: this 
corresponds to nucleotides 138912 to 13934 9 of the genomic reference sequence. 
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CT1041 

Nucleotide 

Genomic coordinates: 

Start: 140110 

Stop: 141616(SEQ ID NO: 28) 
Amino Acid 

MVYKGFVSPSFITTRTMTSNRPTTSPLSFSEGFSLSGDKYDTYEDILLEQFNCFKTSSPS 

SARKSEIEDKTLIFQLKEGEKFHLAKGIEELREILDDNSATIEPIISPTTFNDRNELLNH 

EGDI S SS PLYTQIMKH I SPEH DI YELDL I VGT DLLFGLGVNLRN VSKLMKKI S YGTLNVV 

DVCHRKFFNNRIIVNPISSSFSKNVCI IPLFSAAEEFSSLGECRDLFNGICDDVERYINS 

YFFYPENTTTTTTTAPSSPEMEIADEEEQSPKTIKRNDNASRNW5GVCLIFEVFKNTYYI 

INRGDRGGSFEKAVKSAISSIKEKRCKITDINGNKPRLVMVITGCYTELYFKDALKQIGE 

NRRKFLKMNGN Y FS L I DEQADL I E FAMS VSG AGER I FVNGLGM FQNRKM I PV I DPLTYEN 

VVCGEHDIQKEDAILSVRRAIADYNDFVSKNKRGKKRSAEEENEDEDADASSSSSSSPPP 

SSPPAHKKSRLPDEGEKCTLC 

(SEQ ID NO: 29) 

Top Blast Hits 

Sequences producing significant alignments: 

Q26258 (Q26258) BR2 . 2=BALBIANI RING {5* REGION, REPEAT UNIT 

Q23804 (Q23804) SPID PRECURSOR { FRAGMENT) 

097324 (097324) MAL3P8.1 PROTEIN 

Q99112 (Q99112) HOMEODOMAIN PROTEIN BW2 

Q99111 (Q99111) HOMEODOMAIN PROTEIN BW2 (FRAGMENT) 

YA55_METJA (Q584 55) HYPOTHETICAL PROTEIN MJ1055 



Comments : 

TaqMan Primer/Probe Sets: 
5'start=822 
5'stop=84 6 
3 'start=929 
3'stop=950 

5 1 primer =AAAGAGAAATGACAACGCAAGTAGA (residues 822 to 846 of SEQ ID NO: 28) 
Tm5=57 .46 

3 ? pr imer=GCACTCTTCACAGCCTTTTCAA (residues 929 to 950 of SEQ ID NO: 28) 
Tm3-58.24 

probe 1 =AACTGGTCTGGTGTCTGT (residues 847 to 864 of SEQ ID NO: 28) 

probelstart-84 7 

probe lStop=8 64 

direct ionl=Forward 

Tml=69.07 

scorel=l .92 

length=129 



Score E 

(bits) Value 

38 0.12 

36 0.82 

36 0.82 

35 1.1 

35 1.1 

35 1.1 
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CT1042 
Nucleotide 

Genomic coordinates : 
Start: 141695 

Stop: 142541 (SEQ ID NO: 30) 
Amino Acid 

M AVNLDN VLVN I NNKDE DLTKLVSEA I KRRAKT V FDTKNQAGFDMRRQVEAAL YE AI SKKK 
EKAIKAFDELIQERGDEITPLTTMQYEEWVNRTITPSLTTENLLGDVEHADFLLDRMTPVS 
EEDIEGFAASTFKEVSDSKTATVIVKADCETGDIDEVYNLAPSFGVTQEIKIYRSNNSSEL 
DN VADS FH I YKI SAT DS DSGNTKKLL YGLRNKKAGYTCLCR I FAEI E S DG I MANTN I G VAE 
NNRDE I DENEEGK YG FL I PKQPAGAKL I IYFFLNCWTX 
(SEQ ID NO: 31) 

Top Blast Hits 



Comments: 

EST confirmation of the predicted transcript : 

An isolated EST has equence identity to nucleotides 700 to 1 of CT1042 : this 
corresponds to nucleotides 141748 to 1424 47 of the genomic reference sequence. 

TaqMan Primer /Probe Sets: 

5*start=350 

5'stop=367 

3'start=392 

3»stop=414 

5 * prime r=GGACCGAATGACACCCGT (residues 350 to 367 of SEQ ID NO: 30) 
Tm5=57 . 70 

3 • primer— CCTCCTTAAAAGTAGAAGCAGCG (residues 392 to 414 of SEQ ID NO: 30) 
Tm3=57.66 

probe 1=AAGCGAGGAAGATATTGA (residues 368 to 385 of SEQ ID NO: 30) 

probe Is tar t-3 68 

probelStop=385 

direct ionl=Forward 

Tml=69. 01 

scorel=l .88 

length=65 



Sequences producing . significant alignments: 



Score 
(bits) 



E 

value 



gi|2492980]sp|Ql0043|YRPl_CAEEL HYPOTHETICAL 37.6 KD PROTEIN Rl . . 
gi|401691|sp|Q00933|YSCI_YERPS YOP PROTEINS TRANSLOCATION PROTE. . 
gi|1706480|sp|P51892|DNLl_XENLA DNA LIGASE I (POLYDEOXYRIBONUCL. . 
gi|267570|sp|Q01250|YSCI_YEREN YOP PROTEINS TRANSLOCATION PROTE. . 
gi | 3121979 | sp|O07597 | DAAA_BACSU D- ALANINE AMINOTRANSFERASE (D-A. . 
gi|6686325|sp|P71018|PLSX_BACSU FATTY ACID/ PHOSPHOLIPID SYNTHES. . 



33 
32 
32 
31 
31 
31 



0.79 

1.8 

1.8 

2.3 

2.3 

4.0 
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CT1043 

Nucleotide 

Genomic coordinates: 

Start: 142610 

Stop: 143699(SEQ ID NO: 32) 
Amino Acid 

MTVLAVYTAPQIKKSKKRKIEDENEEEPVKTLEDFVKGRLLNAVKEKPAEYFELLISADT 
EAALKTAEETALRDFVIENDSVEIDVEEVLEEKPREYVFKLAGATSETLTNTIIAEVQKK 
AALITEEDITIKMLKQFRAANKDNKDGEATPEEKEDFTNNSDLVGLYLNEVVEKTTNIVI 
NKIFPHEMVFERCAILIEDFDTGWTDQAIQIPSNKYKIRLVEGDEPEVFPGDCLDLAVS 
VDKINHVLKISAKNGCENNCFVIIPRFSPVGSVSSMILGSTDQVKPKTFLFLANKNDSTH 
FQFTMDKQHSVGCELDMLIFSERNLRNLPDSKPRPLSDADILASYGKRLGTGVFTTENLV 
DD 

(SEQ ID NO: 33) 
Top Blast Hits 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 982 of CT1043: this 
corresponds to nucleotides 142640 to 143621 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=543 

5'stop=567 

3'start=635 

3'stop=659 

5 , primer=CAAAATATTCCCTCATGAGATGGTT (residues 543 to 567 of SEQ ID NO: 32) 
Tm5=58. 37 

3 ' prime r=CTGATTTTGTATTTGTTGGAGGGTA (residues 635 to 659 of SEQ ID NO: 32) 
Tm3=57.33 

probe 1=GACTGATCAAGCCATTCA (residues 615 to 632 of SEQ ID NO: 32) 

probe Is tart =61 5 

probelStop=632 

direct! on l=Reverse 

Tml=69. 01 

scorel=l . 98 

length=117 



Sequences producing significant alignments : 



Score E 
(bits) Value 



Q26938 (Q2 6938) KINETOPLAST-ASSOCIATED PROTEIN (KAP) 

023332 (02 3332) CENTROMERE PROTEIN HOMOLOG 

Q9ZES5 (Q9ZES5) CTC PROTEIN 

AAD4 5753 (AAD457 53) AN0N1A3 (FRAGMENT) 

066878 (066878) CHROMOSOME ASSEMBLY PROTEIN HOMOLOG 

AAD45759 (AAD45759) AN0N1A3 (FRAGMENT) 



40 
39 
39 
38 
38 
38 



0.022 

0.037 

0.049 

0.083 

0.11 

0.11 
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CT1044 

Nucleotide 

Genomic coordinates: 

Start: 143759 

Stop: 144689 (SEQ ID NO: 34) 



Amino Acid 

MSSSSSETPKTSTDTGEERIKDIVNALDNNGEWLSSYIDPIINNHISRKTAETVQKINQE 
VDERYDRKIADKINEIKSSIFTSAQTMYDQYAI DTFQEGKGANGTGPVMGPVNTVIDTTL 
NKMRGNMLEYAEDMWDGDDWKRFSSSMTTLEFDLSYSDLTMMRGSDGYFAFPFRGTKKIK 
MDGSRKKEDPINCIISVTYPNKVGDEWEEGKEREVNFNLERVDDYERDIHVSILCMLHAQ 
LDNFEQALGENANSFY FKKGQRVMFLPKKSKLFNRPTVEDSDMFSIIFPPASDQDFADDI 
YYRIIVTCS 
(SEQ ID NO: 35) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

013706 (013706) HYPOTHETICAL 11.8 KD PROTEIN C13F5.07C IN C 36 0.47 

Q20960 (Q20960) COSMID F58A6 35 0.61 

CAB59514 (CAB59514) HEAT SHOCK PROTEIN 70 34 1 . 8 

Q45851 (Q45851) NEUROTOXIN TYPE F 34 1 . 8 

017208 (017208) C01B12.2 PROTEIN 33 3.1 

BAA83026 (BAA83026) KIAA1074 PROTEIN 32 4.1 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 954 of CT104 4: this 
corresponds to nucleotides 143768 to 14 4721 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=330 

5 »stop=350 

3»start=406 

3'stop=427 

5 1 prime r=GCCAGTGAACACGGTTATCGA (residues 330 to 350 of SEQ ID NO: 34) 
Tm5=59.57 

3 1 prime r=ATCGTTTCCAGTCATCTCCGTC (residues 406 to 427 of SEQ ID NO: 34) 
Tm3=59.04 

probe 1=CTCGAATACGCTGAAGAT (residues 382 to 399 of SEQ ID NO; 34) 

probe 1 star t=382 

probelStop=399 

direct ion l=Forward 

Tml=69. 00 

scorel=l . 99 

length=98 
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CT1045 

Nucleotide 

Genomic coordinates: 

Start: 150687 

Stop: 154344 (SEQ ID NO: 36} 



Amino Acid 

METTMDNWQNNDVTKPTPDVATVTTATEKRQSCKEKKDQLKAECPQVLRALKLSNTLKA 
NFGKSMSAIFAQHLVDMTNAKHFKDPKTKKILELDGSSSSDSEEEEETSSSSKRKRGSGA 
RSASSKKEKCPNTIKNWLNDAQGVFRQFADIIINLPSFDDLRDEVKDEQTELKTIYDLYR 
QDMEKWEEVLGRQDLFDHKS E I AKGLARFDTH VSLLP S DRSAVLDSS I SKELEKNSKGP 
NSNIFDTLNTLKEEIKELLCHHVKYLLQNLTPEDANFVFNSSVKYVKKSYQYYIQTSEME 
S DEFKSLLTGVNIK I LEKI I SS DNNVAT PYKH I TNPRN 1 I SSLQKVRETKPVSKDYPFRV 
DTARDIVLLPETGGISDLPIKPVTLLQLVSYINALFSLERRNVFTDGFFNAACVLISQCL 
TNANLLSNDFPKPIELAANVTRHNLLSMKMLQEGSSSEKKSKKKEKKKDKKKGGGGGDDS 
DSETDSSSSSSSSSSSSSSSSSEDEEEEKGEAVEKGKKTKRKTKKKPSKDDDLDTISKLI 
LKTGGYFHDTSELGNKIRNLI DKDDFAGVAQYAVTITEMQSTPMNQRLVSSLLDLIMRLK 
EQVKYSVDTESTSSTAKSNNALDSAKLTSQQVVTMMVDSGAELARLAAFFFVVVDNTVVN 
RHEAFILTSKLLPSNENRGLKTVVESFFKNLTISNKVSTSNEEMMSVMPFEDEQQQQQCP 
QHEQQPDLKRVVGEVFLEMGKSIVNSFPSNKSVQLTADAFKQNYSPMGRRINLAAKIKTA 
ISIGSNISPNILFSNLPESVGNNTVTGLRLTNLLKNISQSAQANNIIKNANTLVNNTMDQ 
QNSAAMSILLFPPTSKETSIFPGNDPSSIKLQDMTTMSNLARGFYSIAEGCIGVVRSREF 
DEGGVKAYTLLVDSNTMDMAVNFAAQSLEKSMSEALTNNANMNPSNVLEGGSFVDGALSY 
MFEKNGSDCEPTPLAKYTMKDVSNRYLKKFNNDKNTQDLYKNRAERALVEQVTNKPTSVV 
HSQLANAMGVAVIGAASIKLMEAEAAESEMRAANYQATSKSTNAINITNTIGMIRNTTHL 
CTTIAVSAAADMSKLANNHFMSVLNTANNSHSSRRGDRSSLMLQQQQPTHSAFLEQTRGR 
GGGVLGSGTEQTKDHVERMKRDWILNMISPEDKNTTTTTPSNAGRTLGYGSNITGINTIK 
QDDKSMMDKLSEMSSFRT 
(SEQ ID NO: 37) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9XYZ6 (Q9XYZ6) HYPOTHETICAL 75.5 KD PROTEIN 59 2e-07 

O55035 (055035) PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.2 59 2e-07 

VTA2_XENLA (P18709) VITELLOGENIN A2 PRECURSOR (VTG A2 ) [CON 57 5e-07 

Q07034 (Q07034) RNA BINDING PROTEIN 57 9e-07 

NAB3_YEAST (P38996) NUCLEAR POLY ADEN YLATED RNA-BINDING PROT 57 9e-07 

095367 (095367) CBF1 INTERACTING COREPRESSOR CIR 57 9e-07 
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Comments : 

TaqMan Primer/Probe Sets: 

5'start=1624 

5 'stop~1646 

3*start=1702 

3'stop=1720 

5 1 primer =AAAACAGGAGGTTACTTCCACGA (residues 1624 to 1646 of SEQ ID NO: 36) 
Tm5=57.88 

3 'primer =CTGCATATTGGGCTACGCC (residues 1702 to 1720 of SEQ ID NO: 36) 
Tm3=57.81 

probe 1=CACGAGTGAACTCGGCAA (residues 1647 to 1664 of SEQ ID NO: 36) 

probelstart=1647 

probelStop=1664 

direct ionl-Forward 

Tml=69. 00 

scorel=l . 99 

length=97 
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CT104 6 
Nucleotide 

Genomic coordinates: 
Start: 154556 

Stop: 156932 (SEQ ID NO: 38) 
Amino Acid 

MSLVENNTQEEMILETTVEGVVEGAEVAPRGVKRPLPSSSSSSSASDSEDDEGGEQPQTK 
PPKKKRNINSGKYWKIETIEPASPEMLSAVNDIDNVSKTIPLIDNSFGVQFKKSVSEEQI 
KTLLTETIAVEYGTITNVKYSTFNQLERTGEPLKKKRSNNGNNNYRYWQIRIEAAAAENV 
TQAVLDAIVEGNDTVIKAILLPEGEGIGLQFNKSVSSQQAKNIVQAADIEFGQVAHMKCN 
LFHKMEKADESSNSSGESPKVKKVRRNKSQPTNSYYTFTMIGDSLQERIDNAIKVIEMSP 
VKRPFSNSAAAAEEDTTTTTTTSTGVVNPRGIKDIHFFDSSISKGCFTVRNIVAANGEVP 
QEEFVSELYTNLLKVEEKVDHPTFKKLIHDRTMNRHIKAWYCICPYYTTGGVPPAADKVS 
AKGIATYRIYEDRTGVFQFDGAHTSTTPAQAAEATGAIHKSMLFQSPGTDIQKFLDAKKA 
EGLEPISSGEIVYRSKWSPNDSRATRCFKFYSSSDEKMNIADVLSIVHTDGLFSSVHFRK 
DTMEYGVAKSKSKIIPKTIKIKKGGDTFHSEEDIEVPVKFTAITSEELNRECNTKGMNSL 
RAHKKRKSNSSTTTTSTTSTSTTANTPKKTKKSASAASDPFAKLTLDYVDSTSFVFYNIS 
KEMVQRILAQERVKTLKAVKNEEKMEIVEGEEAQETYRGIVKIKTNAKAYNLANKTSGVL 
FPADKVCLKHTLEDLGDVLDFDWREDNVNKTVASTTTTSSENKASGGDDEETPMEFETD 
GEKLLHELLNE 
(SEQ ID NO: 39) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q45759 (Q45759) CRYSTAL PROTEIN 
097003 (O97003) L1156.7 PROTEIN 
002061 (002061) B0041.7 PROTEIN 
AAD55361 (AAD55361) XNP-1 
KI67_HUMAN (P46013) ANTIGEN KI-67 
P79065 (P79065) NOC1 PROTEIN 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 935 of CT1046: this 
corresponds to nucleotides 155963 to 156897 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 
5 , start=1338 
S'stop=l355 
3'start=1419 
3 *stop=1440 

5 ' primer =TACCCCAGCACAGGCAGC (residues 1338 to 1355 of SEQ ID NO- 38) 
TmS=59.36 

3 , primer-TGCCTTCTTAGCATCGAGGAAC (residues 1419 to 144 0 of SEQ ID NO- 38) 
Tm3=59.46 

probe 1 =GAGGCAACTGGTGCTATT (residues 1357 to 1374 of SEQ ID NO: 38) 
probe Is tar t=1357 
probelStop=1374 
direct! on l«=Reverse 
Tml=69.01 
scorel=l .98 
length=103 



41 0.023 

41 0.030 

41 0.039 

41 0.039 

40 0.051 

40 0.066 
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CT1047 

Nucleotide 

Genomic coordinates: 

Start: 159378 

Stop: 161256(SEQ ID NO: 40) 
Amino Acid 

MSCSSSSSCSSSSEEENEVGVEGGGGRIGPTEAKKKILRKRKRSSVKSTSSSSSSSSSSD 
DSDSDREEKEGRKLYVDIADTRKPPKVRKLDTPSQTLENDLYMSSSSSSSSSSSDSSSSS 
GEEESDDDDDDDYDPDNVHVLGCKKEKSPQDIEAEKEKEEEYEEEFKRMALPSRINTSVD 
DCVIPDRILTLFSTLLKKNSFQFSQPVSFLRLVMKQVNEAMNSAFSSMLSSSGMRLVEDS 
LGDTSKISSFITPQTDTSNSSSSSTFVNNCTDEDIKKRNIAMGRVAELLSNIAASSNEEN 
NFRPVVSLMRGPTCGGSNASNKKLNSNRQTIPQVLNKVIFFREIHSVIALYLSSVCVQRA 
MNNDNTNSSGYAEGMVTKILNIIGKIPYNEMSREKFISVGRDALYLYQNVITDMTGPKHN 
KRLRIPQQQADFCYI IAMLVNDVPITSDLLLTGKATNLVQFASAMVDPAYRLAVHKMASV 
FNSSYSVYKVLDLDHKMLLRANLILSILSARNKCLSERKPRTLTQSVYLFLNHLLRNKLR 
SSGLTSEESSLGTAVKLVSQQLMYEGVTRQTIEDGCSMISGNFEDEDGVTLKCLGADVKD 
VKTVGLSALLSDRLRKNIRRNVPFY 
(SEQ ID NO: 41) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

SR40_YEAST (P32583) SUPPRESSOR PROTEIN SRP40 74 2e~12 

NSR1_YEAST (P27476) NUCLEAR LOCALIZATION SEQUENCE BINDING P 63 6e~09 

VIT2_CHICK (P0284 5) VITELLOGENIN II PRECURSOR (MAJOR VITELL 61 3e-08 

095815 (095815) DENTIN PHOSPHORYN (FRAGMENT) 60 4e-08 

VIT_ICHUN (Q91062) VITELLOGENIN PRECURSOR (VTG) [CONTAINS: 59 le-07 

GAR2_SCHPO (P41891) GAR2 PROTEIN 58 2e-07 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 2 to 888 of CT1047: this 
corresponds to nucleotides 160277 to 161163 of the genomic reference sequence. 



FIGURE 3, Sheet 23 of 160 



o ^ m o n- o mu 5: 

WO 01/38351 PCT/US00/28888 

65/201 



CT1048 

Nucleotide 

Genomic coordinates: 

Start: 161717 

Stop: 165020 (SEQ ID NO: 42) 
Amino Acid 

MFGSSANNFNGDKKSSSSSSAAASSDDQQLGPLGLSTADFKKVAAILANRTESLYLLPDS 

PNFKNVINNPNQISIVPFLGSSKAAESGSANKNENQAENSSKGGSDGKKSSQQNKFNLLN 

KVEAEEMAFKRVAELIADTPPSKDNPLRDDPDAIPSRNPWVKLTQKNLEYLFWEAVTIEV 

SNDRSIRSGRYLQASEVGENPFLMTISVDIRILQRMALNWWFFNRFFRMVSGLGVENRA 

NSTYVATSDAIAQIWVEMLLKNFISGENVPQALKYLKEHYEHVYNKISKCGRQPSYFVVE 

FERVDNTIGFVNSDTEHNGSSYMEYRCFDTIRKNASSGPSGGGKSGVLSSGTFFIDNEMG 

NNNSSAAAASAPAVSAGVSPSLSPFSSDGDDDDDDCSGDDVWGKKMIFNTSGDGSGESSG 

QNGGGASTYKRFRCGENTASLSQKENVRLMAMPKGNEDKQLLKNIINFLNSALNSVENHV 

MCTDENIFDEDQAEHYTSNKELYKAIVCSNPANVYRVMVELFVNLILPRLRNPIVSDIET 

VQNLPSNNGSVRTKKMVEHGCTDMRYDIPPYAKGKIRLSAKRACECRKLCKDVRCFDKSR 

EANLTPSQKAGREVEEPFPRNHNSHRSNAHDFTFYDKYRARMNKLKKDSKKKVKKIDTFT 

TTDDFLLQDRNAFDLLRKCFLSASLHHIFCPDVLMVHRGDSFNINFANNKLECYNERNGI 

EEVTSSQTVNAKEALEDITKIKMKRGDDII DVVKSKGLSLREFSKKVSKIVRRFNEITNQ 

LCNNCNVNSSNGDVDFHVFTSVCVY I HNI I PVLEDIS I FAELGEELTKLVKECRDVAGED 

KTYDDIIRNYEITVKYFKLFNALVKFCHRNYNVAVTSAINRRGYMCMVSNLVGYYCKLSD 

NAIQYHESLCSLHSSISYADYYTSRNNNSEDGGGNSSSEKSNADVAKTMASFYDQFDKSE 

DSKKNKNKTSNEILIKMFQMDRVLDGMDDDDDEDSDSSSSENEEEEEEEEIVKKPAKKRK 

VEDVDSNKKTLPKE PAVKKVKQEEDVEMEEVKEAAAEEEKKEEQEAKEEDATEYDDDTEE 

DEKAVAS DE DE DDE DSKAI F 

(SEQ ID NO: 43) 

Top Blast Hits 





Score 


E 


Sequences producing significant alignments: 


(bits) 


Value 


BAA83091 (BAA83091) HHNBV-XIA 


427 


e-118 


Q9YTL7 (Q9YTL7) ORF 4 8 


88 


3e-16 


Q18401 (Q18401) COSMID C33G8 


77 


7e-13 


035788 (035788) CYCLIC NUCLEOT I DE-GATED CHANNEL BETA SUBUNI 


74 


5e-12 


096127 (096127) PREDICTED SECRETED PROTEIN 


73 


8e-12 


096229 (096229) HYPOTHETICAL 78.6 KD PROTEIN 


73 


le-11 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 109 to 282 of CT1048: this 
corresponds to nucleotides 164 907 to 165080 of the genomic reference sequence. 
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CT1049 
Nucleotide 

Genomic coordinates: 
Start: 170115 

Stop: 170733 (SEQ ID NO: 44) 
Amino Acid 

MAGIGRRDNRPVLHLDIDPNKEIPYNVPPTPIICEKNPFVFNMQKCSDCAPFPPYPGTEK 

PFPPYPGTAVEEEEKQKEIEELLVDQSFPPPFPGNKLRDIPRTYPLEFPEKKEKDFPCVD 

TTGHSDIPFIDLEKTPPRSDVRHGYHYLINPNKVGELNHIVGKLTEKQENLNKLVLDVDD 

VVINLSSTLKELEKLRAGLCKFSKN 

(SEQ ID NO: 45) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

PGCA_BOVIN (P13608) AGGRECAN CORE PROTEIN PRECURSOR (CART I L 39 0.020 

CAB40774 (CAB40774) EXTENS IN-LIKE PROTEIN 37 0.13 

AAD45972 (AAD45972) WISKOTT-ALDRICH SYNDROME PROTEIN INTERA 36 0.30 

Q22807 (Q22807) SIMILAR TO E. GRACILIS MAJOR MEMBRANE SKELE 35 0.39 

014686 (014686) ALR 34 0.67 

Q28226 (Q28226) MUCIN 34 0.87 
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CT1050 

Nucleotide 

Genomic coordinates: 

Start: 170831 

Stop: 171461 (SEQ ID NO: 46) 
Amino Acid 

MASSSSSPVALSSVASSVMMERDEENTLSLRNRNVNKPTPVSAAWVPVDEEDEDREEMRR 

LEDFSSDEEDDDNKSCHCDHSDDDDDDEEDPSCFKGFSAGLCSFVRGFFGFLRKSLTKKQ 

VFLLTSAAVAAIFKTRDVAKTEEGAATMEENSTDVITGGDGDSGIAADVVSLASEGEGEN 

GSLLESIATTLIKTTIENLVDGGEETTEL 

(SEQ ID NO: 47) 

Top Blast Hits 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 637 of CT1050: this 
corresponds to nucleotides 170864 to 171500 of the genomic reference sequence. 



Sequences producing significant alignments: 



Score E 
(bits) Value 



077320 (077320) PFC0335C PROTEIN 

CAQC_RAT (P51868) CALSEQUESTRIN, CARDIAC MUSCLE ISOFORM PRE 
O49209 (04 9209) PUTATIVE HI STONE DEACETYLASE 
Q9YPA9 (Q9YPA9) HYPOTHETICAL 45.2 KD PROTEIN 
Q98148 (Q98148) 0RF73 HOMOLOG 
040947 (040947) ORF 73 



45 
43 
43 
43 
43 
43 



4e-04 
0.001 
0.001 
0.002 
0. 002 
0.002 
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CT1052 
Nucleotide 

Genomic coordinates: 
Start: 173177 

Stop; 175862 (SEQ ID NO: 48) 



Amino Acid 
MTRHGVLVPK 
LNKKLGKKTA 
ENAIAAAAAA 
LKNRESFFSV 
ANVSIVNTVS 
AKPLLPGETI 
GEGESGCGDV 
YLYIDMLLHS 
ANLVPSKEES 
WWNKN DQYYG 
KDKLDHIVDL 
QKNGNKMSIN 
NYISTDIAPD 
YKTPGNAYFS 
VNLNKGSDEI 
VKQELVDSLS 
VGLVFKMCDI 
KETGATETEA 
(SEQ ID NO 



GRSRHVILGN 
KKIKKEDAPE 
ATAAAVEKAM 
LIFETGSVIV 
TFNRFHLNFI 
GEYYTKVAMM 
SVVSKLLFGL 
YPYVKPSAEQ 
ISFIKKIRSQ 
KERCDPFSVA 
CTEEIVEECE 
NNSNYLSGSS 
FAKLFGNDVY 
TLFEKSIINN 
IKGICTSCDQ 
SSSSPSSSSS 
LASLASRRGM 
QIFNKI IGSE 
49) 



VDYTFCTTDN 
TKENSDEDIY 
LSESEGKSMV 
VGLQDPSLTK 
RLGKFFERNC 
RDSKDPNFKM 
FHYFMDNNIK 
VKRAMVDQQH 
QLFGHLCKPS 
RLVSVSENTN 
SKGFIASPFL 
RAKRNAKLQE 
SLLHLMTNLP 
QETANKGNNR 
NSTSYIENAL 
QTSNKNNRCT 
EDRPTANYRT 
KGLSILCQLV 



NCVSLDIDFK 
ATKEFEQTIK 
INRARMVLSK 
LCVIKATTDI 
ISYSYNPETF 
SDWLRIKTAL 
MSPKEAQRVR 
ISEVDRTYYG 
KETTRRAIDT 
SMMNSRISCQ 
RKHQKEKIPT 
KHRVTLARLN 
KSRGHALTYN 
KRKFSRIGQE 
SDINRDKKIK 
PSDFIDYVYK 
SLHSATQNKT 
ERRNKDNNVF 



DNITDQNIQL 
GLQTKKGATE 
RDTSQKQFTA 
ADILQKNISV 
PGMFFKLRVP 
TFKVGKITVL 
EKYGI PHLEW 
TKNSMDAAMS 
LSFDPINQDR 
GKWWLDENEY 
PYVLLARACN 
TMMASYRFLN 
ERALSSNEST 
KSSFLCNACG 
RFKAAATHPP 
FTDETTGAPK 
NLNKLLVSAI 
D 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



gi | 664 7869 I sp j 029874 | TF2D_ARCFU TATA-BOX BINDING PROTEIN (TATA-... 


44 


0. 


001 


gi 16647868 | sp|027664 |TF2D METTH TATA-BOX BINDING PROTEIN (TATA-... 


43 


0. 


002 


gi|3122925|splQ12731 |TF2D_EMENI TRANSCRIPTION INITIATION FACTOR... 


43 


0. 


003 


gi| 135643 1 sp I P13393 I TF2D_YEAST TRANSCRIPTION INITIATION FACTOR ... 


41 


0. 


010 


gi| 135627 I sp | P2 81 48 | TF22_ARATH TRANSCRIPTION INITIATION FACTOR ... 


39 


0. 


038 


gi | 135626 I sp | P28147 |TF21_ARATH TRANSCRIPTION INITIATION FACTOR ... 


39 


0. 


050 


gi | 3334374 I sp (043133 I TF2D_CANAL TRANSCRIPTION INITIATION FACTOR... 


38 


0. 


065 


gi| 135639 I sp | P17 871 | TF2D_SCHPO TRANSCRIPTION INITIATION FACTOR ... 


38 


0. 


11 


gi!2833518 1 sp I Q57 930 I TF2D_METJA TATA-BOX BINDING PROTEIN (TATA-... 


38 


0. 


11 


gi|417896|sp|P32085|TF2D_CAEEL TRANSCRIPTION INITIATION FACTOR ... 


37 


0. 


15 


gi|417882|sp|Q02879|TF22_WHEAT TRANSCRIPTION INITIATION FACTOR ... 


37 


0. 


19 


gi | 121568 I sp| P22 010 | GR78_KLULA 78 KD GLUCOSE-REGULATED PROTEIN ... 


36 


0. 


25 


gi I 3122941 | sp | P93348 I TF2D_TOBAC TRANSCRIPTION INITIATION FACTOR... 


36 


0. 


25 


gi 1 3915894 | sp | P52653 ITF2D ENTHI TRANSCRIPTION INITIATION FACTOR... 


36 


0. 


33 


gil 1729908 |sp|P50159 |TF22_MAIZE TRANSCRIPTION INITIATION FACTOR. . . 


36 


0. 


33 


gi| 1351224 | sp| P48511 |TF2D_MESCR TRANSCRIPTION INITIATION FACTOR... 


36 


0. 


33 


gil 1729907 |spjP50158 |TF21_MAIZE TRANSCRIPTION INITIATION FACTOR... 


36 


0. 


33 


gil 135640 1 sp i P2 6357 | TF2D_SOLTU TRANSCRIPTION INITIATION FACTOR ... 


36 


0. 


43 


gi I 1709903 | sp| P54637 | PTP3_DICDI PROTEIN-TYROSINE PHOSPHATASE 3 ... 


36 


0. 


43 


gi I 28 334 59 | sp | Q55031 | TF2D SULSH TATA-BOX BINDING PROTEIN (TATA-... 


36 


0. 


43 


gil 586175 I sp | P32 623 | UTR2_YEAST UTR2 PROTEIN (UNKNOWN TRANSCRIPT... 


35 


0. 


56 


gi 128334 4 6 ) sp 1 052366 | TF2D__PYRKO TATA-BOX BINDING PROTEIN (TATA-... 


35 


0. 


56 


gi (3915739 I sp I P 1 8 4 2 8 ( LBP_HUMAN LI POPOLYSACCHAR I DE-BINDING PROTE . .. 


35 


0. 


56 


gil 135634 | sp| P26354 |TF2D_ACACA TRANSCRIPTION INITIATION FACTOR ... 


35 


0. 


74 


gil 135636 I sp | P20227 | TF2D_DROME TRANSCRIPTION INITIATION FACTOR ... 


35 


0. 


74 


gi|417102|sp|P32103|Hl_EUPEU HI STONE HI, MACRONUCLEAR 


35 


0. 


74 


gi | 2833477 | sp 1 Q57050 I TF2D_PYRFU TATA-BOX BINDING PROTEIN (TATA-... 


35 


0. 


74 


gi|3041729|sp|Q03410|SCPl_RAT SYNAPTONEMAL COMPLEX PROTEIN 1 (S... 


35 


0. 


74 
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gi I 312294 6 | sp I Q4 2808 I TF2D_S0YBN TRANSCRIPTION INITIATION FACTOR... 
gi I 126745 |sp|P14 873 |MAPB_MOUSE MICROTUBULE-ASSOCIATED PROTEIN 1... 
gi I 4033393 I spt P78695 |GR78_JMEUCR 78 KDA GLUCOSE-REGULATED PROTEI . . . 
gi|126511|sp|P12744|LUXB_PHOPO ALKANAL MONOOXYGENASE BETA CHAIN... 

Comments : 

TaqMan Primer/Probe Sets: 
5 , start=1268 
5 , stop=1291 
3'start=1327 
3 , stop=1350 

5' primer=TTGGACATTTGTGTAAACCTTCAA (residues 1268 to 1291 of SEQ ID NO: 48) 
Tm5=57.29 

3 1 prime r=CCTGTCTTGGTTTATAGGATCGAA (residues 1327 to 1350 of SEQ ID NO: 48) 
38)Tm3=57.4 5 
primerScore^O .74 
allelel= 

probe 1 =AACTACTCGACGTGCTAT (residues 1296 to 1313 of SEQ ID NO: 48) 

probelstart=1296 

probe 1 S t op= 1313 

direction 1 =Rever se 

Tml=69.02 

scorel=l . 97 

length=83 



35 0.74 

35 0.74 

35 0.74 

34 0.97 
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CT1053 

Nucleotide 

Genomic coordinates: 

Start: 175839 

Stop: 177107 (SEQ ID NO: 50) 
Amino Acid 

MSSTDLSKNAFHDWVVSKTDCEVFDVHCETDRDCGAACENTYSVDGKEVTKFSCNQQSGR 

CARSVYSASSLERAANDLGHIIGIIKKNPKLEEELPESFLWFINHNGGDLFVNKRAAYYD 

TMHLSIGKLDNVDTLAQGLDKRMASSLREHLLRKLDSILLQIDKVKYEKAKKWILDITQE 

AGTEEDNKEEEDAKKEDQSLSVSEIVDVLTGTHDPMPLRARGFIQKKIYPLSRNELRELA 

LKELFPEETTSPQVLSRQHDVSTREDLCNESMNAGRAESIFSDPDSGEYVATCACLYSEY 

LTGPACKHKTYRYVIDYDKWKRTGRPEFLTDPVLHFKKAEAVCKSTNPNLRAIYSPDNKG 

FLCAPVAELVKTALTFRGSHEPSLIVERDINQAENLPSNSFGVNWPYVNLLNRIQDQYT 

(SEQ ID NO: 51) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q26648 (Q26648) TEKTIN Bl 37 0.30 

RRPP__VSVSJ (P03520) RNA POLYMERASE ALPHA SUBUNIT (EC 2.7.7. 36 0.52 

RBB1_HUMAN (P29374) RETINOBLASTOMA BINDING PROTEIN 1 (RBBP- 36 0.68 

CENE_HUMAN (Q02224) CENTROMERIC PROTEIN E (CENP-E PROTEIN) 35 0.89 

RRPP_VSVIM (P04880) RNA POLYMERASE ALPHA SUBUNIT (EC 2.7.7. 35 1 . 2 

Q89487 (Q89487) PHOSPHOPROTEIN 35 1.2 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 1109 of CT1053: this 
corresponds to nucleotides 175933 to 177041 of the genomic reference sequence. 
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CT1054 
Nucleotide 

Genomic coordinates: 
Start: 177123 

Stop: 178524 (SEQ ID NO: 52) 
Amino Acid 

MSASLILDEYLKKTASAVLDVADSFEKIKGEIQSPEEAAALSVALYGAPPKPSASAVASII 

TGERTSLNDKYLSDNVLLKMSVARVGQENNRKRADQAADEIRTIMEDITGSLSGAYRQYSP 

LEEENKVHIGIMNNKTPSIVCGYYTMDTSISSEPLSLTDFQNPTVIANVTKRMESIFSKVD 

SARSTRFDAFVNGVANNMDIKSSIDWANMVENVIKLPDSTPNPCSVDTIVSRDASVVKTAV 

NDIYASVGKSYCRPATQLTFMSEIEKLRKAAVVCFEALMSDTRERAFVEFLFYVSFKEDAS 

NTNSKLFVQNKLSSMSGNPRQPIKLVRRSAEETLFGLCFMFKVMPPEFMNCIFNFPTIPHS 

TQYHGLYGTCLTPLLRKYGSSFEKSWAHFEEILSERANAVKKFGVNDTRIDCLDAVANLTG 

PVYVLI LDLVRTLSAQRSCSTKFLRE I KEN YLLWNRFVSX 

(SEQ ID NO: 53) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

gi(126194 | sp | P17279 | LEU2_RHIRA 3-ISOPROPYLMALATE DEHYDRATASE (I... 33 1.4 

gi I 6685598 isp 1 095613 I KEND_HUMAN KENDRIN (KIAA04 02) 33 1.4 

gi I 24 97227 | sp | Q04893 | YM96_YEAST HYPOTHETICAL 113.1 KD PROTEIN I... 33 1.4 

gi|134393|splPl7065[SEC2_YEAST PROTEIN TRANSPORT PROTEIN SEC2 32 2.4 

gi I 547908 | sp IQ02 4 55 | MLP1_YEAST MYOSIN-LIKE PROTEIN MLP1 32 2.4 

gi | 24 96893 I sp I Q094 62 | YQ52 CAEEL HYPOTHETICAL 30.9 KD PROTEIN CI... 31 4.2 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 899 of CT1054 : this 
corresponds to nucleotides 177561 to 1784 59 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=678 

5'stop~701 

3'start=754 

3'stop=777 

5 , primer=CACCTAACCCTTGTTCAGTTGACA (residues 678 to 7 01 of SEQ ID NO: 52) 
Tm5=59.08 

3 ' primer =CAATAAGATTTTCCAACAGAAGCG (residues 754 to 777 of SEQ ID NO: 52) 
Tm3=58 .08 

probe 1=TATTGTGTCCAGAGACGC (residues 703 to 720 of SEQ ID NO: 52) 

probelstart=703 

probelStop=720 

direct ion l=Forward 

Tml=68.92 

scorel=l . 92 

length=100 
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CT1055 
Nucleotide 
Genomic coordinates: 
Start: 178529 

Stop: 179348 (SEQ ID NO: 54) 
Amino Acid 

MAQTSKMGTNKRCFEEEVEEERQQPFTKKSKSEPPSFEDKSSSTSSKKKSKSNKHTKTKE 
EQLLE FVKDLERSDPTVPDEKVKQEVEEKSPEAIAEIFSMFGIAQDSKFKSLLPIERIKS 
I TTK I V I DA I NQ PVRKMLV DHL Y H FKEMQN VVEKYKDDS DEKLSV I LKS KKS PKE FDLS F 
SDYVDRLNRILVGVIKRVAGAIESKELLQSNSMIMNSVLGTVVSNIPYNMKINICVFLTN 
FICTFANDDLYTFFRDDEKFVMSQVTRYISKD 
t SEQ ID NO: 55) 

Top Blast Hits 

Sequences producing significant alignments: 

001761 (001761) C. ELEGANS UNC-89 (GB:U33058) (NID: G1160355 
Q17362 (Q17362) UNC-89 

Q17595 (Q17595) SIMILARITY TO MYOSIN HEAVY CHAIN 
031329 (031329) ERPM 

FKB3_YEAST (P38911) FK506-BINDING NUCLEAR PROTEIN (PEPTIDYL 
AAD55361 (AAD55361) XNP-1 

Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 1 to 661 of CT1055: this 
corresponds to nucleotides 178612 to 179272 of the genomic reference sequence. 
Hit to public sequence gi 1 6856160 1 gb I AF173992 . 1 to CT nucleotides 647 to 819 of 
nucleotides 910 to 738 of the public sequence with a 100% homology, a score of 
343 and an Evalue of 6e-97. 

TaqMan Primer/Probe Sets: 

5'start=269 

5 f stop=289 

3'start=340 

3* stop=360 

5 * prime r=CCCCTGAAGCTATTGCTGAAA (residues 269 to 289 of SEQ ID NO: 54) 
Tm5=58 . 07 

3 1 prime r=GCTCTTTATGCGTTCAATGGG (residues 340 to 360 of SEQ ID NO: 54) 
Tm3=58.32 

probe 1 = AAGTTCAAGAGCCTTCTT (residues 322 to 339 of SEQ ID NO: 54) 

probelstart=322 

probelStop=339 

direct ionl = Forward 

Tml-69.00 

scorel=l .88 

length=92 




PCTYUS00/28888 



Score E 

(bits) value 

45 8e-04 

45 8e-04 

43 0.003 

42 0.004 

41 0.007 

41 0.009 
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CT1056 

Nucleotide 

Genomic coordinates: 

Start: 185432 

Stop: 186830 (SEQ ID NO: 56) 
Amino Acid 

MAVGDYLSMSSVGEATLVGFM I LNFI N FVT ILSLI I YAVTDVYRRCKRPSTNGYSGCTTN 

VVSSTLQEANLVTTEKDKPVQFVRGLVPRKMMEKYRSDLSPKNVGEYILPSEKETDKLKS 

DYKKGKKVGLLTALSNGHDSNKRIIGPRDLISRDDVKDKSYVFKRLSKDPLVYYSSATSK 

WRKFSPFRAKKFMTSTQLGSKLVYPHPIRYGTAFVLPTGYVINKAYGMDNEDLHTWNPP 

SSSVLVPDSNNDRLTVECAKTDPTHRIGIYGFGGSDDNRRAKEEGYVEMLLCNCDNHKDL 

LKAPLITEYSTNPTEIQVDVAAKRVLFPAPGSEPVKSSQVTSAAHQLDGATGEHDISHEP 

VKLSDTGDYAVGSPIVFKPVYGTSLVNLPETGSPLALNCPCTDKADGIYQVNQKGGILYR 

DMVGYLNAN PVEAASLSSSDSSSWLTTGNKISSVTCEGEKIKKIV 

(SEQ ID NO: 57) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



Q99175 (Q99175) HYPOTHETICAL 58.5 KD PROTEIN 


34 


2. 


2 


CAB49723 (CAB49723) HYPOTHETICAL 52.6 KD PROTEIN 


34 


2. 


.9 


Q9Y6X0 (Q9Y6X0) SET-BINDING PROTEIN (SEB) 


34 


2. 


.9 


Q9ZXH3 (Q9ZXH3) INT44 


33 


3. 


,8 


Q55105 (Q55105) MULTIPLE LIGAND-BINDING PROTEIN 1 PRECURSOR 


33 


5. 


.0 


Q9WXH8 (Q9WXH8) PYRUVATE ORTHO PHOSPHATE DIKINASE 


33 


5. 


.0 



Comments : 



TaqMan Primer/Probe Sets: 

5*start=663 

5' stop=684 

3'start=705 

3'stop=725 

5 ' primer =CGTGATCAACAAAGCATACGGA (residues 663 to 684 of SEQ ID NO: 56) 
Tm5=59.59 

3 , primer=GAAGAGGGTGGGTTCCAAGTG (residues 705 to 725 of SEQ ID NO: 56) 
Tm3=59.41 

probe 1=GG AT AATGAGGATCTACA (residues 687 to 704 of SEQ ID NO: 56) 

probe Is tart -68 7 

probe lStop=7 04 

direct ion l=Reverse 

Tml=68 . 99 

scorel-1 .88 

length=63 
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CT1057 
Nucleotide 

Genomic coordinates : 
Start: 190875 

Stop: 193236 (SEQ ID NO: 58) 
Amino Acid 

MEYMEEGDI AE RRS EGVDY I LDENS ACVVNVKS I RNRLGAM DAEEAQYAQDI SAQLVTH I 
IRLAHCSESNKIKDTIASIAGLFINNIFDNNSTKNKLKTYNQFKAESQNKSSVLNIFGSL 
DPLSMLSSFMGSDPAKSGGENLDKSLGVLFEVLQNYNPCKIDDIVLLEMCPSKCAACTGL 
KEAIRQEQPMEAMLLFFKCINHNRFNFGSDIKSAYASETCMRYSQDERAVWPLRSILLG 
CLDRDDPAHTLSSFGDTIEYADSDNAWVSSLFAAVSRMPMVDRAVIAHFYVYTMLSRHRR 
VSGDSFKQFVYTVFVRMIYSAIEILFCDTENSSVECDGKHFLSYVNAMVNVSVLGSTFNV 
LKAYRSWWDQASVAPVLDIISGGWKKNYPSPDHIKRVAYDISQVINHLASPSRMVKGNN 
KASNVTSGLDSIRSVRQAEKYIPFGILENKAGYGVINIAKHNISRPAREQSNGRNFNCNA 
LHILPSIKGCEALGAQKGSADQTVNVFDNFVASHMDIAMKKQGSGKILGLLTSMIDRQGL 
TTSFPSSEAEYKKRIHDFTRYVIFSSTPINDELVNSRCILPHSNVLNSPISLRNIDPESV 
PDTRFHFLLMMWQRPNIDEPNLSALTTSQLELLLSKNQKWDKLTTRAFFNIDRINFQMAD 
AI IKNVSGSGFLDGSKTAS SSSSAPN FFQI FSGAECTAKQLQS IRKFI GESMQHVQKEWS 
SAVNNGNRGVENYDGLNAQFSEELFELLYKLI IEEDMRPSSLIASSEFLSNYVNAMDELL 
I RAN AS 

(SEQ ID NO: 59) 



Top Blast Hits 

Sequences producing significant alignments: 
CAB52133 (CAB52133) RCOP C7 (FRAGMENT) 

IDHC_SOLTU (P50217) ISOCITRATE DEHYDROGENASE (NADP] (EC 1.1 
Q23615 (Q23615) ZK822.4 PROTEIN 

YSW1_CAEEL (Q10017) HYPOTHETICAL 63.8 KD PROTEIN T25D10.1 I 



Score 
(bits) 



E 

Value 



34 3.0 

34 5,1 

34 5.1 

33 8.8 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 636 of CT1057 : this 
corresponds to nucleotides 192639 to 193274 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5 , start=1214 

5 f stop=1236 

3 , start=1281 

3*stop=1303 

5 • primer=TCATCAATCATCTTGCATCACCT (residues 1214 to 1236 of SEQ ID NO: 58) 
Tm5=58.13 

3 ' prime r=CAGACCTGATACTATCCAGGCCA (residues 1281 to 1303 of SEQ ID NO: 58) 
Tm3=59.10 

probe 1 AGGCTAGC AACGTTACAT (residues 1262 to 1279 of SEQ ID NO: 58) 

probe 1 s tar t=12 62 

probelStop=1279 

direct ionl=For ward 

Tml=69.01 

scorel^l . 98 

length=90 
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CT1058 

Nucleotide 

Genomic coordinates: 

Start: 209615 

Stop: 227849 (SEQ ID NO: 60) 
Amino Acid 

MDQYPEVRDTPQTEQEQAAAQQQQAATTTAAAAAAAAPTQYSNTVSAETLSAISEDGKLE 
RSIAASCWINNLNPDEKMAQRVQFHPLSSTTTYDSENVNPGSSVVFLKPRALPTGGTCLA 
PNYIAVPTLRAASEIIDSIASTSLYQCSMFNSWNLIPI FMSNSKHSQFGDRVIKRSMIRN 
CFSKQKNVENLLKELRRRKVNAAKAFSHAVQQKSAVNTALAAWNAGSAANLEKLVDFCKL 
KYSPDRKYKAGGLFSASATAQSQSGTSSSSVEHTSNDFLLDILKRHKGTSLDLDSATNTF 
DTALSRVFTEFKEQARAAVDAAADSDHLSASDPIFSI VRHNSRREGILDSVPNIGMLAPR 
SKYSVAEYLMADRDESADIAAKIGTKIATDFEALRGDNNKRRADTSVDDLKESLADSIEK 
TSIKNTGDINSVTNIPTDTEEYEFSLHITQLFAQAFLETMGSLLSCAFGVQFPFSDEGFA 
AIERIIRKTDPDTGKVSEMDPSSLSDQYLLLVGNFQVSPFHVSDPKDIVFGRQVTPNTPI 
LSIITRSKNDKNETSTIINFRDRLLVNDTVLRDATQNVSTSTPSQRRVPTAAGEPKKPML 
SGCLPIIRGPQVVTRESDDMISGLVGDWYISLGVYYAMGSSAAAIAAGHQRALASAESIN 
S PMMKKFSKKGGKYTEEEKR I KKAMRRN ADRS ARI LALLGQT DAQYG Y VEHNS T L DS FW S 
SNAAIRAKAKEDALSRAEILAVRKQLDGKCSSSRDEYSMVERYLRDSFFRSVNRSGGGYE 
M FDQG FDMGRFADFLS DNS AARN AWQQ Y AE VMRGLS KHEKRV FN I EGL FS ALN S FKFPLV 
PEQGRKKTVGGRHRLNNLKAANKIINGITEMTLQSAI DGTGISDIIGSVSDGWGNTTAQP 
SRVKALKTLSN FSGN GNWS I PVSRA VKCAAGS RGGE TLKCV D I PSV 1 1 ANL I S DKR I L D 
QLCGGGMNLAHEITNFIETIAGKEHTGKESVFLSPRLSVILLRYIWFNAAWSLTDSNIK 
MPLNTMSEGTGDDIYRDYIiAIRGMVNNYNSSLSSISVKAISDRYNCGSGNTSTSNKNVTI 
KTQGELLTVLQQTANALSAFTNKGGVGATPDAANMANVISPIANADVVKNTNVVVSGLDR 
ITETINFFSFLSQIKTMNENIEEYLRRYRLGEGLDKKELDNFVYPNIAAIVKRELGVSGS 
ALSSNLDTDRPITIDLNTEQPLI VKASKGYASNRYAKLFNKTTRTAAEQAQMEQYNAQMA 
ANTI PQLVNRLT I PGS I TADTAI NVVKA FTENGEFSNAETHLGVMGN A I NEMQPLFTDGF 
NVANKRLTVNVGSVSKLIQNGLTVSLILAHSKASPYVFKPLVQDFAKLLLAVTAETSLVV 
SRSQKSFFPIPPSVFSSGGLFKI DREMFDNMKTDYVVEVIRQLSKNATAAIERCNDSDSA 
ARIAKSGEIYNKDVASTTAAPGTSSSALTLFANNLQNPAKVWSMGALPHFDMAWPKLHG 
ISHDQMFRLSTYYQGIHKMELNSDCKPEEWDNSLPGNRASKFFGLSSVSDNNRSFNLALD 
TLLASPAEICDLVTREMVKTSNDIVHNIGSNSNTDALQKSLQVGASAVEKYDESTLSTKE 
TDVYSLVSALAKSKS PLSSSSSLSSEGHLTSKEI DRTWNTPALLGTAKTTSYSVSEDALN 
APLSAVLDFRRNWDATKSLYEVAAVCSVMSKEEDVRSSSRKIMGMVEQESPVMQDIGID 
RIASLVSTVATPKQHRRFLQTVNDYKNYLIRKVASNPLLSSRLGGISPTSGNTDYNLKAV 
YDGVVSSSSSMTPSSMSVSDRFWSGVFSQCLETGPSMFADAGHGGSNMFQITAPKLYGSR 
VNTY7UVLSSGVERLRDSISSATQERKNRIAKSIEALETFVTDVVGGDTLDQLRKAQNMYN 
KLSDITSNSI YSDFGNIDCAKIMKNVTSKKMTARQQSDTILSSLLHELAGLVHKQQPQLA 
TQFALAS HV I KAKYVTN DLNN I HEKET FSQLM AV AGV A DY YN VSAAAMCQRLVAS DVTMF 
LGGTMLQQGLFVSFLLNNVLFSQVSDNIKMNELNDETKSLLVKLVGFCGTVSDALGSRHV 
SSIRRVQNEEDKKLDRSFVTSLYSAYRDLRKKTELYRETDTINKLFGHQNFMSYESSMLK 
RTSLVHDAVSGPRPRRYSTLEDVLEAPSTVHKSFMVSYPERAAASRRVKRAGLRALADNR 
MESLYGEEVLNDMRSSAVSSEMMDIEYGEGGFMMMISDDEDDIAFIDSEEESESSTDFSS 
SDEYSDSSDEYDFDDDNNGQSPYSTTSYSYDALDRLNSAAKPLTAIYGCRGEGEDDEEND 
LYEEEQERRRRRSSKMGKILRDLHESDDDDDDYFDDEFDGERSMSETIATRRAGRIQYGP 
GFLSHSNILNRPAKARAFLTRGKKFRPSAYDRFFMEDDDSLLFSDESTTSSSSSDSPFSS 
FSKGRKCKRRTSEDQCAFVKRVVRAFVPTRVTMINGRVSMITPVTSENTVGFYENYQKAN 
KRERARLIEEYKIVKGASATLPDEYVEGRASKQVSPRELRRSLIKAAAYVARTQESNLNI 
IFDALTTTSNATLVNDPSTLLGDTLLFAKQLEAITERRNRLMKDLTEISPSLFTSFGDAS 
KDTQMMADAKQIVSGGNFKSAGYLGVPLRTLASCIKGTNTVDRLLATECNKNHLEWMTTAA 
IVFARSFNDTTFHALEDTLKMTSALTDMYSAFTNLVGSEHSORLKVKSTLLDSI FNTEIMA 
HTEAVMGLVYPTAFINHEMPSDYTQRREMQSLALNILRGVNCSQLPRKDIGDTAGLLTFI 
TSRKFAGYGGERGGLSLYRMSIVDALSCPSDNRLKGAVSLEVGKWQDMGEEIFYKRSNDL 
VDFCSKNNISLENAVGPIARFVPNGTNMADIGMTDI I SRTVKDDASMIRLRRAEEGAGAA 
GKFITASAMGNLYGGIDTWNLTEKLYDSFVLLQDSDSFNTPTEMATAIINRMKSRKHKA 
LKTPFGGDIATYKNFPSSSEAI VVRAKEMRNSISTI VMDISKSRGINSFSSRSGSTLAKI 
STSEFERILETSAVLSNTECANLRTIENRLAEHYNKLKQFSHISNDGLSETRAVVAVIAES 
LTPVYADDTSERGASVSELLTDNTLLKFIVQNELKNI EEAKRHVTAAIEGSSQLHEKMLS 
LLVASADINRMSAQNNLECKKLTEGNSNFVPMTNDQGGTFIKHKETGIWLKTDEENNTSS 
IKDNDORRVAKTILAIVEDNRNATIRSRLQSLCFGKYAMNDIFALDDADIKNMDKLIEKL 
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GEAIAEKASPSSSAISSSSSSNTTSSSSSPSSSPSSSSSSFSMDYSNNLAKTIPYMPIVF 

QNKQSNVNSSDASSSSPSSSSSSSANIDNVEHKKVALQQLQTQESNDLSNVLSVTTKHRF 

ASHNQAATVGIFNGRQHAETVVAIPNANKANNNATVSAGQGILTRFSAPENVSSTSMQLP 

PSSSSSSNGDDNKVPVTVRLNQYANSILSSIENASEFKDLKEAERKIDLAIQAASTTETK 

EMVTVSKCPSANQTAITAISQAKSLKKSALELLERVIKAVEVYTPDSSIAAVSLPVNGDS 

MVSSSSGSGSAPSSSSSSSSSSSSSNVTDYFNYAYGKLKNIDENTEEGAETVQKNMVEQD 

AAVRIPLLVSYAPFSEMMRRAIDKLNEYYQLIDAIKTKIVSDTKQASSWAIKETDKELDM 

DPCEQVISKINNLQQNFSNESDKIKMAISVLDNKRNELELQNNKTRSFIETTKSRIEAGGG 

DVANFKEII DYENTSENDNNLFQSLKAFAADNSGTVYTPTDMSNGRDTKSDSKFVDMYNK 

QILEGGIKLINEGQNTVKVDFSKALEAFPRQSNGASEPVSSSVVERRQRERLQAVEMFMA 

IMMERTESLRKRLADSAAQWNTVNNVEETVNSGMVNIKSERLTEIRNQAQIAESTALNSI 

NDEIVESPLTLSLGARVDQLLIKVDRVGSIQQQQQQQQQQQQLPKLTATEQRKEQQYAAD 

RVV Y DPSYTC FLQPLHETI KRI SSVYNSKNKGPLSNTRGVPTS DADLQLMTI T DLSRSVL 

DSSSTSSKKMLYENVPSSIVPGLCQQCAMMITNVHEATHTSPHSFNFENKRSLKQLTEML 

NAATSSSDGPAVRHDVLTMLESNNGYVKDFGFTHRQKVACITPVNTLLGGTFSGNVAPNT 

VILPTSELFNCPGVENDKFRSMVNRTTDKNVADAPKSSASIVETLARTSPNAEHLYFPFK 

DQRRH FNSITDAIISGMSGESSSQLNTTCDQNLVNIDQTTGFPVFTGRKQGERRIVHTEN 

TMEGARKDKNSGIPSCTKDRQTYIDMGTKFMVAPGSLLNANKEETLRLNRLSDINNVRHY 

GTDVHVAGANSAWRIGEVVRAASSFPDGDKESAMKKMLLLGSVSAISAQKSASHINDPTA 

LLSTNTSIQNLVKEAFPDPVCSSNYLGSAESTFATQLAYRQRLFPNGDDENVTTVSNICP 

MDLMGSTKRYNDAFNNIFGSKMTSTNKKGSNCENLLKSAMSNVPAINTAFGAFEEASSSV 

RNRLSPLYEDSTKYSSNQLAVQAMTDTAVDALSAVSTVVGRQNGRNTLLSLPTSITSIAT 

SGRPSLSYSSDMKSNLIKTISRINRDASLLSMGDSQVAAGSSFFNSFLRSSSIPVTTSQD 

GNVAAAEIVLGTILDKTVEINKRFEMLGGGKMVAGSPEARAIQRNTMSSILQMNENELAR 

DLCE I ENKI ETRQLRDAFQDLKRSMLMTPGGVGAISSGASTNNVPLSLLMSRVDASSGLL 

MNNNSANVMEAVDSFNTTPLLVRHMMLDSGKSPVPMAKEIRSMLTQPRALTARALLSESS 

PLLTEICLYNTRDTQPERAVDRLLTSAYLVKQAKRFDGVDPAFPAALTCASHLMLSSMDS 

HTKSSFMDNIKLHMT DTQCFFKNIERFEKFLGRYGDEYAMSHKQNCNCPFHLHHTFTPSD 

NEHLVSSFAFARPEVSMEEIRATPYQANKLISDKHYVMNMSKIDSRVTGSSLLKKVSEWT 

EMRMNSNFNGTFEPSRLALSNSGMTTAGVNLDVIVKPNNARSVLGILECHRQHVCTADAK 

GTV AS AMP AV FQAT DGNGNE SELIQN ALPRNRY IQKS TMN AQT VV FAN VLEQL I ADLGKV 

IVNELAGTIAESVPESVYENTKEMIDRLGSDDLFKSNNNGGVESMDYEDSETTSNNGPVL 

ISEAMKNAVYHTLISGKAARPENVPFASCASGPLAFDFLLSKGDTFEEKNAEQGAAAAVS 

STYSSSSNTTLRKHLARVFEAISKQVTDAEFKDILNDIERNISSDYTNCPPNTNQNAFAL 

AIKREFSRIVSFLTILRKNITPALVDPKGALHEKVAIYLTLLSTKSKLENFFQYGLSNSS 

SVDLSHLKPINCSNNVKNIEDTFMYRNVHPILIMALPENFTALLQQEQMDPDTAIESRRS 

LTTFLN H PNT ASMANGARAAVGAGGGN PMGLYLSSH I LHES TVTTSN P VTDTTEN VN YHS 

SVTQDPVMVVNPFKDSARLIVNNNNTGIDVLNDKSCNYLQVSMPSESSGLVTNTGCSSSS 

SSSSSDTFKYVRRDNTPVNLPRVTPAVLCSDASSNLLDVFSRADIVLENMNVRFGFMPEI 

IAAVSKFKGLTKEEVIKQMVSQNNINNNSNNNNGNGKKTTVDPVTGDIVITNATFPDTRP 

LYTAANGGTSSFKWGDINDRKMHAKAFPTFFIGNPTAAATANGVPLTSEGISLTEEKRKK 

IAGISEGSIGTGALRAAANTRLSSDMEPVMKGWNNIVQLQQTFKKASDKLTHLLRSGGIP 

PRSQETNAIINKMHDSFKTLEECRRVIQDEAALLVATSDLLTGGYGGDAALAMVSPVRPE 

MTGL I GAI SA PVRG I SHLLKLGGVSAANAAI RKRLNLPTSNGKTLPEHG I VHKSAKTLLL 

DSDSISNLYNTDLQDVVSNARDNNNLGRIMQSLGLKGNNAGDLVYSARQLTDLITVPEYG 

NNRDLTKRQAILKMLISNPEILENVADTIYLTTGKNALAPVSAQEMACASLTVGGSGGGK 

LSSDDNVQSLNRLYFRV 

(SEQ ID NO: 61) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

SR40_YEAST (P32583) SUPPRESSOR PROTEIN SRP40 80 5e-13 

094317 (094317) SERINE-RICH PROTEIN 80 5e-13 

095815 (095815) DENTIN PHOSPHORYN (FRAGMENT) 80 6e-13 

YM96_YEAST (Q04893) HYPOTHETICAL 113.1 KD PROTEIN IN PRE5-F 76 9e-12 

AGA1_YEAST (P32323) A- AGGLUTININ ATTACHMENT SUBUNIT PRECURS 73 8e-ll 

Q9Y076 (Q9Y076) PROTEOPHOS PHOGLYCAN PRECURSOR (FRAGMENT) 71 3e-10 
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Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 704 of CT1058: this 
corresponds to nucleotides 227240 to 227943 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5 , start=8829 

5 ' stop=8848 

3'start=8894 

3 , stop=8917 

5 ' prime r=CATTACAGCCTCAGCCATGG (residues 8829 to 8848 of SEQ ID NO: 60) 
Tm5=57.78 

3 1 prime r=GCAGAACGAACGAGTCGTATAGTT (residues 8894 to 8917 of SEQ ID NO: 60) 
Tm3=57.76 

probel-GAGGTATTGATACCGTTG (residues 8861 to 8878 of SEQ ID NO: 60) 

probelstart=8861 

probelStop=887 8 

direct ion l=Reverse 

Tml=69.O0 

scorel-1 . 99 

length=89 
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CT1059 

Nucleotide 

Genomic coordinates: 

Start: 228374 

Stop: 230564 (SEQ ID NO: 62) 
Amino Acid 

MDKVCVISNTRERTFKVPADLLCVATEPEISTKEEDAGIEIETRVVVFSRCVSVQELHTI 

NPNDEGFSVQLFKDYLKLQSAQGKKPIGLYIQIKAGEDLERRLISGGTAYLDPATHLFYL 

DFSLYPNYS I FNDISSRLKI I DEDTYNGWFSNSEEKEKDALVLIRVTFSTHEKAIEAAI 

KKIMLRKVFFKDGDLDFGYLRIPKSKLDKFTPYFRSOYGIVNVEKNIPGYIWGEIMKQRV 

RCSRWYLYNTDSEWEYKNVAEERVGPRQLVKKYGAKCENLCFRDIDLRKKEAKEKRDIER 

ETESRYVWTLTHKHEMPENMPYFGPKCSVVRLDETRILLCFVDEISYNDEDVDEILSEN 

RSLRNVSIRHKENVPVHTLLKKGVSIHARFTLNGLDDALIILKRIPKTYFEDEELQAACA 

HVNLEQYEWLCSNNRGNKVEHVKSRVVTRAVKRRRKCRHWIYFDKDTLNLNYKYFDKKVT 

ASMASKICNAKHDCLVFHRKMELEDLTESAY FKVEPSPINFAKLKSCPDVKYVQKKTDGT 

FSVIRFFRNMTKGDLIQRMDLFCRFIPDSHTITLLSRADFYACKRGESMHMCTNKHRILH 

YKFSNAPHAAIEQITNIISDTRGRKGIHIEYAIENVQEMYEEDGRRYEAKYTGTLTEYKR 

NEDKTFKSLLAPHLTPVNKPYNINHLYEQYGNFDEELEDKLRSGFISYDTYVTAKDNWGR 

CATGKGACI 

(SEQ ID NO: 63) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Y061_CAEEL (P34600) HYPOTHETICAL 84.7 KD PROTEIN ZK1098.1 I 36 0 . 95 

Q9ZAJ8 (Q9ZAJ8) BONT PROTEIN 36 1.2 

Q01794 (Q01794) MAJOR SURFACE ANTIGEN MSG1 (FRAGMENT) 36 1.2 

Q9X708 (Q9X708) BOTULINUM NEUROTOXIN TYPE B (FRAGMENT) 35 1.6 

GLND_HAEIN (P43919) [ PROTEIN-PII } URIDYLYLTRANSFERASE (EC 2 35 2 . 1 

BXB_CLOBO (P10844) BOTULINUM NEUROTOXIN TYPE B PRECURSOR (E 35 2.1 



FIGURE 3, Sheet 37 of 160 



WO 01/38351 PCT/US00/28888 

79/201 



CT1060 

Nucleotide 

Genomic coordinates : 

Start: 230616 

Stop: 231582 (SEQ ID NO: 64) 



Amino Acid 

MCTLKTYKMTTSTEISKNLSDVLSIKATGDWCSNIKTVFSPFTEGKGNLPNSLPFTRSPN 

TTCGSREAANATEHFITVFAKDKYERKRVKRTIGFTLDNTKELTPNRYLVADVYSWQEEK 

MVFEGFCVPPGKSGTFVRYSNEDKSFLLADTGRYMKKKYDDPENKTSSGGDDDDDDDDDD 

DDNNNVDVYEENDPRNVFEVEKDEKYACTFSILVYRAMKKSPPVCRGLLVETDGPSSHPK 

RAPSAFNPFGGSSMLNGYGAGADALEEEDEVDGVPERERITN FALKRGPATGQNFVSVKL 

EHDGSKADLYNVTCFSKQRGV 

(SEQ ID NO: 65) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

097300 (097300) PFC1035W PROTEIN 51 le-05 

KEX1_YEAST (P09620) CARBOXY PEPTIDASE KEXl PRECURSOR (EC 3.4 47 2e-04 

Q83970 (Q83970) (CPV) 46 3e-04 

SIS2_YEAST (P36024) SIS2 PROTEIN ( HALOTOLERANCE PROTEIN HAL 46 3e-04 

YB00_YEAST (P38114) PUTATIVE 126.9 KD TRANSCRIPTIONAL REGUL 45 6e-04 

077384 (077384) PFC0760C PROTEIN 45 8e-04 



Comments : 



TaqMan Primer/Probe Sets: 

5'start=477 

5'stop=500 

3'start=555 

3'stop-578 

5 ' p rimer =CGATGATCCAGAAAATAAGACCAG (residues 477 to 500 of SEQ ID NO: 64) 
Tm5=57 . 85 

3 ■ primer— TCGTTTTCTTCATACACGTCAACA (residues 555 to 578 of SEQ ID NO: 64) 
Tm3=58.34 

probel=ATGACGATGACGACGATG (residues 515 to 532 of SEQ ID NO: 64) 

probelstart=515 

probelStop=532 

direct ionl=Forward 

Tml=69.04 

scorel=l . 95 

length=102 
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CT1061 
Nucleotide 

Genomic coordinates: 
Start: 231602 

Stop: 232799 (SEQ ID NO: 66) 
Amino Acid 

MQLILSHHLTMAGRVELVTGPMFAGKSTYLKNI YQQENGGNKHCLFVKHSLETRYGCGTG 
T I VTHAGEV I EGCTTVSS IKEL I S VLPEVVDVI LI DEGQFFTDLVLVNRLADKGKRI V I A 
ALDGTSDQQMFSPIHKLLPYTNSIVKLASKCMICKIDTKEAPFTVRFGNDNDNNVICVGG 
AEMYAAACRDCYKKINKKKNKGKLVVLEGGDRCGKSTQAKLLLTNKNSPLYGGEYMCFPD 
RSSHTGKLINDYLTKKIELDDHAAHLLFSANRWEVCSKIKQLLDDGIHVVMDRYYYSGIV 
FSLARGVDTVEWCSASDEGLPQPDLVLLMLLDVEKCSNRDTFGVERFETNSIQERARALF 
LDLANKDEKNVWIKVDARGTIEEVQTKIINIVYNIVEE 
(SEQ ID NO: 68) 

Top Blast Hits 



Sequences producing significant alignments: 



Score E 
(bits) Value 



074528 (074528) THYMI DYLATE KINASE 



165 
159 
157 
148 
143 
141 



6e-40 
3e-38 
le-37 
8e-35 
2e-33 
6e-33 



KTHY_HUMAN (P23919) THYMI DYLATE KINASE (EC 2.7.4.9) (DTMP K 

KTHY_MOUSE (P97930) THYMI DYLATE KINASE (EC 2.7.4.9) (DTMP K 

KTHY_CAEEL (Q22018) PROBABLE TH YM I DYLATE KINASE (EC 2.7.4.9 

KTHY_YEAST (P00572) THYMI DYLATE KINASE (EC 2.7.4.9) (DTMP K 

KTHY_SCHPO (P36590) THYMI DYLATE KINASE (EC 2.7.4.9) (DTMP K 
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CT1062 
Nucleotide 

Genomic coordinates: 
Start: 232843 

Stop: 233334 (SEQ ID NO: 68) 
Amino Acid 

MLPRKTLPDTENGYFVLDESLLEKVYYDNNNELITOVGGIYMQICKSKYIFHHDDPERFF 
YSVLEDYHPIKEIVERLAEEDGVFLGPWEFLSRKQVNLQHGCYKALLSLPEDKYCNLLLP 
QQMKTNLEKMEEIQRTRLIHSRTYNTPQIELSDQLDGCVIC 
(SEQ ID NO: 69) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

060678 (060678) PROTEIN ARGININE N-METHYLTRANSFERASE 3 ( FRA 31 5.4 

P95966 (P95966) ORF C04027 30 9.3 

Q9ZW94 (Q9ZW94) F5A8 . 4 PROTEIN 30 9.3 

058387 (058387) 358AA LONG HYPOTHETICAL PROTEIN 30 9.3 

TaqMan Primer /Probe Sets: 

5'start=223 

5'stop=245 

3'start=309 

3 ' stop=330 

5 ' pr imer=GAACGACTAGCAGAAGAGGATGG (residues 223 to 245 of" SEQ ID NO: 68} 
Tm5=58. 17 

3 •primer=TGGCAATGACAAAAGAGCTTTG (residues 309 to 330 of SEQ ID NO: 68) 
Tm3=58.99 

probe 1=AAGTGAACCTCCAACACG (residues 284 to 301 of SEQ ID NO: 68) 

probelstart=284 

probe IS top=301 

direction l=Forward 

Tml=68. 99 

scorel=l . 96 

length=108 
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CT1063 

Nucleotide 

Genomic coordinates : 

Start: 236678 

Stop: 238604 (SEQ ID NO:70) 
Amino Acid 

MVASTPCPGPGPVPTQELLSTNFLEAHKLVVELLLPSYSSDVVYCDSETYTKPIPIFGNK 

SIVSTIGDYVLSNPNEDVSYQMVSSVLEKFPLLFHCTYKTNEEDKGIPLWKKLYNKRKFK 

LLNSLLVHNNKNWTPVPAIPFDRENICDASGRSVI^SEIMSTSTFQTICKNNTHYLFDML 

NMERGKQGGSFLHFFASRKNSFTNFENEEMDSHVLSNIAKFICNEKEKLDSFIPANGKIP 

CPDKTNDEGYIPLEIAIMEDNYPALLYLVCRYGASWANTYGDHNESLKAFAIRNDAKDCL 

EIIEFISDHYSFNKNVTKEEFVKEKTVECVGCLYDIEDEKRCYKLPCGHFMHTFCLSNKC 

SKANFRCVKC FQT FDDT I FRKCPPT I QWKMG I NQTTNHKEMDLFNRAFDTYLDFICS YNV 

KLDKKSKPKHKPENKKVEEELAKRTAEIEEAIKKKEEELAKRTAEIEEAIKKKEEELAKR 

TAEIEEAMKKKEEEELSKYNKI IEKGKRRLNEECVKLRDISTAAINMYKEKVRINGVLLK 

DSDQELAEAKERLRKILLLEEETKLDRFLFRPKRVEERIFLTKDDETLAFKLALEKKTED 

IIAKKNNQKGSERRDGEYTITSHIEKLPQSTALASVCVLNE 

(SEQ ID NO: 72) 

Top Blast Hits 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 362 to 1 of CT1063: this 
corresponds to nucleotides 238163 to 238524 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5 f start=775 

5'stop=797 

3 »start=850 

3'stop=872 

5 'primer— GAAGACAATTACCCTGCATTGCT (residues 775 to 797 of SEQ ID NO: 70) 
Tm5=58.33 

3 •primer-GCAAACGCTTTGAGAGATTCATT (residues 850 to 872 of SEQ ID NO: 70) 
Tm3=58 .56 

probe 1=TAGGTATGGAGCATCTTG (residues 810 to 827 of SEQ ID NO: 70) 

probe Is tar t=8 10 

probelStop=827 

direction l=Forward 

Tml=68.95 

scorel«=l . 95 

length=98 



Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



Q26938 (Q26938) KINETOPLAST- ASSOCIATED PROTEIN (KAP) 
Q9ZIU2 (Q9ZIU2) VIRULENT STRAIN ASSOCIATED LIPOPROTEIN 
050870 (OS0870) HYPOTHETICAL 54.3 KD PROTEIN 

Q9ZU69 (Q9ZU69) PUTATIVE VICILIN STORAGE PROTEIN (GL0BULIN- 
023230 (023230) TRICHOHYALIN LIKE PROTEIN 
MNN4_YEAST (P36044) MNN4 PROTEIN 



74 
73 
73 
68 
62 
62 



3e-12 
5e-12 
5e-12 
2e-10 
le-08 
le-08 
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CT1064 

Nucleotide 

Genomic coordinates : 

Start: 238658 

Stop: 239438 (SEQ ID NO: 72) 



Amino Acid 

MSTCSNLLSVFGGGDWTTTFPFDLVHTRQECDKKREQDYSFFITETCKGENIGIHSYEHT 
SKIIDTGNNDSTSIEELEVLNIYKAINHLENILKLNKGEKIIL.MDVETMILETHKILMKG 
ILPKGKNGSFSTCVRFAVNKNNERHYYPVFETEKEAFNSIQNLVDYYNEIVAHTNDQIKI 
IKACAYFMYNFLTLHPFNDGNGRTARLLYSFLLKGNGIVPHFSPITHPRDQFVDTLVYFR 
EHGDGRPLLYVLLES IKNK 
(SEQ ID NO: 73) 



Top Blast Hits 

Sequences producing significant alignments: 
Q23544 (Q23544) ZK593.8 PROTEIN 

Q9ZHQ9 (Q9ZHQ9) HYPOTHETICAL 26.1 KD PROTEIN (FRAGMENT) 

068899 (068899) HYPOTHETICAL 29.5 KD PROTEIN 

Q48249 (Q48249) PLASM ID PHPM18 0 , COMPLETE SEQUENCE 

075406 (O75406) HUNTINGTIN INTERACTING PROTEIN HYPE { FRAGME 

AAC96089 (AAC96089) HYPOTHETICAL 23.9 KD PROTEIN (FRAGMENT) 



Score 
(bits) 

50 
44 
43 
43 
42 
42 



E 

Value 

2e-05 
0.001 
0.002 
0.002 
0.004 
0.004 



Comments : 

EST confirmation of the predicted transcript: 

Nucleotides 1 to 677 of CT1064 : this corresponds to nucleotides 238717 to 239393 
of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=397 

5'stop=419 

3'start=450 

3'stop=472 

5 ' primer=TGCGTACGCTTTGCTGTAAATAA (residues 397 to 419 of SEQ ID NO: 72) 
Tm5=58.67 

3 , primer=TGAACGCTTCTTTCTCTGTTTCA (residues 450 to 472 of SEQ ID NO: 72) 
Tm3=57.92 

probe l^AATGAACGGCATT ACT AC (residues 424 to 441 of SEQ ID NO: 72) 

probelstart=424 

probelStop=441 

direct ionl=Forward 

Tml=68.96 

scorel=l . 88 

length=76 
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CT1065 

Nucleotide 

Genomic coordinates: 

Start: 240712 

Stop: 241192 (SEQ ID NO: 74) 
Amino Acid 

MEDLKSTIERVYEERVENLEQWTNTVEEEERTVSAIDSVLEEQKRALDAWEAAIKEREND 
LAVKEGISALVFNAADAKTRKELINTWIAERETSEKRRKEATSTNNQLKNQMSSLVNTTK 
TLKEKYNKYYRRSAILNMQYINNKRDYEASQFWVYTNNA 
{SEQ ID NO: 75) 

Top Blast Hits 



TaqMan Primer/Probe Sets: 

5'start=228 

5 , stop=249 

3 , start=297 

3'stop=320 

5 * primer =CGCCAAAACACG T AAA GA AT TG (residues 228 to 249 of SEQ ID NO: 74) 
Tm5=S8.34 

3 'primer =TGATTATTGGTAGAGGTTGCTTCC (residues 297 to 320 of SEQ ID NO: 74) 
Tm3=57.45 

probe 1=AATACGTGGATAGCCGAA (residues 253 to 270 Of SEQ ID NO: 74) 

probe Is tar t=2 53 

probelStop=270 

directionl»Reverse 

Tml=68 . 84 

scorel-1 .84 

length=93 



Sequences producing significant alignments: 



Score E 
(bits) Value 



RADI_HUMAN (P35241) RADIXIN 
RADI_PIG (P2 604 4) RADIXIN (MOESIN B) 
040947 (040947) ORF 73 

SSP5_STRGN (P16952) AGGLUTININ RECEPTOR PRECURSOR 
Q9Y489 (Q9Y489) CENTRIOLE ASSOCIATED PROTEIN CEP110 
Q21952 (Q21952) SIMILAR TO MYOSIN HEAVY CHAIN 



43 
43 
42 
41 
41 
41 



0.001 
0. 001 
0.002 
0.005 
0.005 
0.005 



Comments : 
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CT1066 
Nucleotide 

Genomic coordinates: 
Start: 241184 

Stop: 241505(SEQ ID NO: 76) 



Amino Acid 

MHKFSNKFYFIIKGVLIIIFVPDVVFSIFLLPPLGVRHKNGGGGNEEQKSGPSQKHHIPG 
PVLIFVLIIVIVGSVVIIIGVLISVRIAVLLWSHPYIHDGQDEDTN 
(SEQ ID NO: 77) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q19507 (Q19507) F17A2.11 PROTEIN 33 0.78 

Q37367 (Q37367) NADH DEHYDROGENASE SUBUNIT 1 (FRAGMENT) 29 $ 9 

PSC_DROME (P35820) POSTERIOR SEX COMBS PROTEIN 29 8.9 

YG35_YEAST (P53273) HYPOTHETICAL 117.0 KD PROTEIN IN ASN2-P 29 8.9 

AAF05147 (AAF05147) ORF33 29 8.9 



Comments: 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 452 to 1 of CT1066: this 
corresponds to nucleotides 241246 to 241697 of the genomic reference sequence. 
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CT1067 
Nucleotide 

Genomic coordinates: 
Start: 241774 

Stop: 243409 (SEQ ID NO: 78) 
Amino Acid 

MFRQFCSLYLLQRRVNDNLRSTASASAAASLKGDGTEFITGEPPSHKMRGPSYSVLGPDP 
CEDPERVYVDIVVSILQTNNIQVTKEWELFSDKLRKLGPWIDRSGIENNGEGEEDGDENE 
DGGGNGGRIEDREAHRRKMMKKLSFVGREDPVAVDLPTWRENSTEFARRLTLKELCDLIV 
ECGCIKSKEELFDFIFEEPWEIKEAADVRGMANRSKFTKESLIDWFFEFDTYSKCVVFFE 
AVNWYLKSQASPISLVLDDIYCCVFSYIRRQTFLTRAKNPSLTVASSFSPTPDTKLLAID 
ECVQHFLKSDINISQMALTERDCFFPLLTEMPRQQKKVNTFLDTMKRPTLSLLPSTSSSS 
SSNNKRKRNTAAANILLPVYRSNFSTASNNKRLKTDDGENASACILIEGYANGKISPIRI 
MVRKSTIIPEVFNHLLFPVFASKDTGANILFFIKMKSFASASLLLPGLFRHPKQFLNGPC 
KWMTLAENNINDNNINSSTMWSYTLADYCPLGYYTQESPQPYQTCGNFTSTTNKRLQNVQ 
PLYF 

(SEQ ID NO: 79) 
Top Blast Hits 

Sequences producing significant alignments: 

CAB38842 (CAB38842) HYPOTHETICAL 35.2 KD PROTEIN 
CAB52581 (CAB52581) CONSERVED HYPOTHETICAL TBC DOMAIN PROTE 
084402 (084402) RIBONUCLEASE FAMILY 
Q59085 (Q59085) RNA POLYMERASE SIGMA-54 FACTOR 
080815 (080815) T8F5.22 PROTEIN 

PI4K_DICDI (P54 677) PHOSPHATIDYLINOSITOL 4-KINASE (EC 2.7.1 
Comments : 

TaqMan Primer/Probe Sets: 
5'start=717 
5'stop=74 0 
3'start=791 
3'stop=814 

5 , primer=TGAAGCAGTCAACTGGTACTTGAA (residues 717 to 740 of SEQ 3D NO: 78) 
Tm5=57.74 

3»primer=TTTGGCGTCTTATGTAGGAAAAGA (residues 791 to 814 of SEQ ID NO: 78) 
Tm3=58 . 08 

probe l^ATCTCAAGCGTCTCCAAT (residues 741 to 758 of SEQ ID NO: 78) 

probelstart=741 

probelStop=758 

direction 1= Forward 

Tml=68.99 

scorel=l . 99 

length=98 



PCT/US00/28888 



Score E 

(bits) Value 

39 0.061 

39 0.10 

37 0.31 

37 0.31 

35 1.2 

35 1.6 
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CT1068 

Nucleotide 

Genomic coordinates: 

Start: 243216 

Stop: 243798 (SEQ ID NO: 80) 



Amino Acid 

MDDSSRKQHQRQQHKLFHDVELHASRLLSSGLLHPREPSTLSDMRQFYFDYKQETTKRAA 
I ILLNTLLEYYRTPSEEWE I P FNLLLN VMNN KWSTLI PGVKI SAGI I SKLPWTMKTMYEI 
VSSPNNNNNNGDYYSTCRRMVMEYPIGGLLHTPAITNKYPRSRMVTCTKGKDHQKLYDIS 
RQMFDI IEANGQL 
(SEQ ID NO: 81) 

Top Blast Hits 

Sequences producing significant alignments: 

P3K3_DICDI (P54 675) PHOSPHATIDYLINOSITOL 3-KINASE 3 {EC 2.7 
001590 (001590) K09H11.1 PROTEIN 

RRPL__DUGBV (Q66431) RNA-DIRECTED RNA POLYMERASE (EC 2.7.7.4 



Score 
(bits) 

32 
31 
31 



E 

Value 

3.0 
6.7 
8.8 



Comments : 



TaqMan Primer/Probe Sets: 

5'start=234 

5 , stop=257 

3'start=308 

3'stop=331 

5 • pr imer=GGAAATTCCGTTTAATCTCTTGCT (residues 234 to 257 of SEQ ID NO: 80) 
TmS=57 . 99 

3 1 primer^GGAGTTTCGATATGATACCTGCAC (residues 308 to 331 of SEQ ID NO: 80) 
Tm3=57.75 

probe 1—GAGTACACTCATTCCAGG (residues 279 to 296 of SEQ ID NO: 80) 

probelstart=279 

probelStop-296 

direct ionl=Reverse 

Tml=68.82 

scorel=l . 82 

length=98 
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CT1069 

Nucleotide 

Genomic coordinates: 

Start: 244241 

Stop: 244856(SEQ ID NO: 82) 
Amino Acid 

MDLSFTLSVVSAILAITAVIAVFIVIFRYHNTVTKTIETHTDNIETNMDENLRIPVTAEV 
GSGYFKMTDVSFDSDTLGKIKIRNGKSDAQMKEEDADLVITPVEGRALEVTVGQNLTFEG 
TFKVWNNTSRKINITGMQMVPKINPSKAFVGSSNTSSFTPVSI DEDEVGTFVCGTTFGAP 
IAATAGGNLFDMYVHVTYSGTETE 
(SEQ ID NO: 83) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

OB5179 (085179) FLAGELLIN A .34 0.60 

FLAA_CAMJE (P22251) FLAGELLIN A 34 0 . 60 

BAA83944 (BAA83944) UNKNOWN 34 1.0 

FLAB_CAMJE (P22252) FLAGELLIN B 33 1 . 8 

AAC25644 (AAC25644) FLAGELLIN A 33 1.8 

AAC25648 (AAC25648) FLAGELLIN A 33 1.8 

Comments : 

EST confirmation of the predicted transcript and hits to pblic SBV sequences: 
Nucleotides 1 to 803 of CT1069: this corresponds to nucleotides 244132 to 244934 
of the genomic reference sequence. 

Hit to public sequence gi | 68561 62 | gb | AF173993 . 1 : CT nucleotides 1 to 615 match 
nucleotides 323 to 937 of the public sequence with a 100% homology, a score of 
1219 and an Evalue of 0. 
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CT500 

Nucleotide 

Genomic coordinates: 
Start: 2425 

Stop: 1537 (SEQ ID NO: 84) 



Amino Acid 

MKNSRQRSGVWRGNSCLYKSFYFSGAIIECKKIRIIMMFLLLSLILFVCFVGVVVGVIFM 
SRPNKTTTTSNKKTKKDKEKEKEDDTEGAVLGRREPENRPIGRDEEGAVEDGKEEEEVFE 
FEQPSVNTGSNTGGGGTGTVPGEGLLPPPPPTPTPTPPPTPTPTPPPPPTRTPSPSSSLG 
EDDDDDIDIDFDDNDIDEFLDSGEEMEEDEEEEDLDTLLSRLETGMSGEEVDFDASSAYI 
QPDPVVVKNIERSDYTLDPMESWKVLNRSEGDIRFFVDRGITNKIKAMTEDLKEL 
(SEQ ID NO: 85) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q48373 (Q48373) CHITINASE PRECURSOR 64 2e-09 

08647 6 (086476) CLUMPING FACTOR B PRECURSOR 59 4e-08 

092451 (092451) ACMNPV 0RF91 58 9e-08 

Y091_NPVOP (010341) HYPOTHETICAL 29.3 KD PROTEIN (ORF92) 58 9e-08 

Q69023 (Q69023) (B95-8 ISOLATE) U2-IR2 DOMAIN ENCODING NUCL 57 2e-07 

Q42421 (Q42421) CHITINASE PRECURSOR 57 2e-07 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 24 4 to 513 of CT500: this 
corresponds to nucleotides 1979 to 2248 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5 'start=419 

5'stop=440 

3'start=505 

3*stop=523 

5 * primer =TGCCTGGAGAAGGTTTGTTACC (residues 419 to 440 of SEQ ID NO: 84) 
Tm5=59.11 

3 * primer=GAGATGGGGTTCGTGTCGG (residues 505 to 523 of SEQ ID NO: 84) 
Tm3=59. 88 

probel=CCTCCTACTCCTACTCCT (residues 448 to 465 of SEQ ID NO: 84) 

probelstart=4 48 

probe lStop=4 65 

directi on 1 « Forwa rd 

Tml=69.04 

scorel^l . 95 

length=105 
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CT501 

Nucleotide 
Genomic coordinates: 
Start: 7645 

Stop: 7042 (SEQ ID NO: 86) 
Amino Acid 

MTMWNKIVITTKRMNWPMVVGVFFILAITALAVALYIRHASKQEKYSTSHINEQFTAKQL 
PVTYLSKTGKLKDMHLTHSDFMAYVDVHNRTKTLKHPMCTDEAGWAHFCLLASAEAYRRI 
RYGRGEFGPEKHSLAETIQSTVQDMSEPYITHIFKKNTDVDGHGMQSVLEKNRNKIRMGD 
GKTSSETYNLSDKSISIVGV 
(SEQ ID NO: 87) 

Top Blast Hits 

Sequences producing significant alignments: 

Q218S9 (Q21859) R09D1 . 3 PROTEIN 
Q9Y3S0 (Q9Y3S0) EMDC II PROTEIN 

AAD25099 (AAD25099) METALLOPROTEASE DISINTEGRIN CYSTEINE-RI 
031548 (0315481) YFJL PROTEIN (RIBOSOMAL PROTEIN L6-LIKE PRO 
Q44602 (Q44602) PHOSPHORIBOSYL ANTHRAN I LATE TRANSFERASE 
CAB52230 (CAB52230) HYPOTHETICAL 33.8 KD PROTEIN 

Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 587 to 1 of CT501 : this 
corresponds to nucleotides 6996 to 7582 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=245 

5'stop=266 

3'start=315 

3 , stop=332 

5'primer=TGGCATATGTTGATGTGCACAA (residues 245 to 266 of SEQ ID NO: 86) 
TmS=59.44 

3 ' primer=AGCAGGCAAAAGTGGGCC (residues 315 to 332 of SEQ ID NO: 86) 
Tm3=5 9.78 

probe 1=ATGTGTACTGACGAGGCT (residues 292 to 309 of SEQ ID NO: 86) 

probe Is tar t=292 

probe 1 S top=309 

direct ion l=Reverse 

Tml-68.96 

scorel=l . 96 

length=88 



Score E 
(bits) Value 

34 1.0 

32 4.0 

32 4.0 

31 5.3 

31 6.9 

31 6.9 
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CT502 

Nucleotide 

Genomic coordinates: 

Start: 8502 

Stop: 7641 (SEQ ID NO: 88) 
Amino Acid 

MSSGSINNHPSSNMDTNKMEEGEEQDFDVLELDYSKIIHDITAMLSVAAPPPNSILDASD 

GLIATASATAPAAETGNSNRMRLDKDVCQLIERDIELVKSDTIEVDSIIRQLLYFGESAS 

EKNIKTNSTEKEPVYFPKEPKGEAVKLAKNTPVLDTITKLDWMANICQSNKIGVENLASA 

LQSGQLIWTTFPAAVYASLDS FYHIAIMWKLLGSFINIEALSKGSKDNLLPRDDIQVVHA 

KQEIAAMLQSRQNILGRGPSEYPPVPITAILSRTIIPLLRNFSEKL 

(SEQ ID NO: 89) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

043631 (043631) SPINDLE POLE BODY PROTEIN SPC98 HOMOLOG 38 0.063 

060853 (060853) PROTEIN ENCODED BY SACCHAROMYCES CEREVISIAE 38 0.063 

060852 (060852) PROTEIN ENCODED BY SACCHAROMYCES CEREVISIAE 38 0.063 

AMPR_HUMAN (P15514) AMPHIREGULIN PRECURSOR (AR) (COL0RECTUM 34 1.2 

BYN_DROME (P55965) T- RELATED PROTEIN (TRP) (BRACHYENTERON P 33 2.1 

AAB32396 (AAB32396) T-RELATED PROTEIN 33 2.1 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 753 of CT502 : this 
corresponds to nucleotides 7704 to 84 56 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=315 

5'stop=336 

3'start=392 

3 'stop=412 

5 ' primer =TGACTCCATTATTCGCCAACTG (residues 315 to 336 of SEQ ID NO: 88) 
Tm5=S8 . 70 

3 1 primer =TGGGGAAGTAAACTGGCTCCT (residues 392 to 412 of SEQ ID NO: 88) 
Tm3=58.70 

probe 1 =TGGAGAATCTGC ATCAGA (residues 345 to 362 of SEQ ID NO: 88) 

probelstart=345 

probelStop=362 

direct ionl= Forward 

Tml-68 . 85 

scorel=l .85 

length=98 
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CT503 

Nucleotide 

Genomic coordinates : 
Start: 924 8 

Stop: 8552 (SEQ ID NO: 90) 
Amino Acid 

MDPGASAASRRALWSSTVTNTRHYQQQLNRALNKIEEEDDVEEEHGQVTTTNKEMASTST 
SSSSSSSSSPTSSAI PSSDEEEEEEEEYDSESDTNVDSLLGEEEEEDS DTESTSADANFL 
RSSSRNSTTRNRLIKKYVDRFIKYEKDILIADRNKRKKRHRNRQPQIHKLNNKRLKKPTD 
KKQKTNKKKTWRRLPKFIKKMSPASRLKFFSACI ISGIKITSIIVLSIMAL 
[SEQ ID NO: 91) 

Top Blast Hits 



Sequences producing significant alignments; 



Score 
(bits) 



E 

Value 



NSR1_YEAST (P2747 6) NUCLEAR LOCALIZATION SEQUENCE BINDING P 
FKB4_YEAST (Q06205) FK506-BINDING PROTEIN 4 { PEPTIDYL-PROLY 
GAR2_SCHPO (P41891) GAR2 PROTEIN 



56 
55 
52 
52 
51 
50 



2e-07 
4e-07 
5e-06 
5e-06 
6e-06 
le-05 



095367 (095367) CBF1 INTERACTING COREPRESSOR CIR 
SR40_YEAST (P32583) SUPPRESSOR PROTEIN SRP40 

O08904 (008 904) BRAIN X-LINKED PROTEIN (BRX PROTEIN) (FRAGM 
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CT504 

Nucleotide 

Genomic coordinates: 

Start: 13936 

Stop: 9328 (SEQ ID NO: 92) 
Amino Acid 

MAHKLLFXEEEDAKEIGTLSHPEPSFALYESETFRSVGFCKNVTDAYPKFLPRPMDINSV 
QALAVRLALIQFYKGRGWKKNMSIIDLVKDKVERNFKVDKKTSGGFIIGDGTGVGKTREL 
AAFVMSVILQEKALLDVQKHVGPSIFGQDSDKVITAINSGVWKRHPFFIWLTCSKPLFNS 
CQQGMREVVTNSRGLRDPKFSWRKLQVPCANKPTSFKSDGKSGSMTVDVENSVSSAKDSV 
DIRFFTLRDVKEFHSKRSSRSIGDFLTETPTILFMTYSDLRTNLEFVLKFITGGTDLDSN 
KVMPIDNFVTALLCDEFHKTQNISDS FRKELAKTWEEEDTRVLRNIQKRANPSVSDLINR 
FKSAMSDDRNFKVKRMKSSNNKGRVTMSNYLKLLSQADAFRIFLEILKYDTFTVMASATP 
FQSNADLHMIDHILRKSAPAYTSIQAFKEVSSATPDAMAEHSEYVTVFLEQVIKLLRNRG 
QLVS RSI SMAGVDCSTTNCKAS PLQKYAI DELASYCLNARQVL I DSEKVGGH VRRAFTKI 
IREHQEGGILEEEDVEKLVAEINSPSRKRKRAANDDDLYEVMENIDRRFKWVVRDRDVA 
HDGKTTLRSIVQDAIKTYSQKKDALSNGGGGIITSPEVDISSIDMVAQDLYDAIKKKEKP 
SKGKTDFNEDYDDGANEEDGWGEVFDDECFEKLRRQYFINTASTSVAACKGALLNIKATS 
VTDAVKRLRTTNESKKMVMSLEQTGDSFLKNLTTRILQTIAKDESDAKYGIVDVGTFDSS 
PVANTIFSGYRLLCRAVMMASAFTISLKNKTNRRTSPAHVMLVPSVPDTEPLMALAGNPI 
DSITQSIGEDSNAEITNRKLCSRITNRGLFLVKNNTKTANTNKCISAFNNTKEVDVIMLG 
PKGNTGLSLHDSSNNSMYAKRYHCVLDVPYNAIAFLQTIGRTHRNGQLSVPQFLIFSTDS 
PAERRFFDSLDKRIKDSKAGTYADRYSNNSIDIAAAVMREQFIDQGLVLKTMGNIVQIVT 

asmtkvhlmehfskmmmrtnrggvafvegltlengiftevivlamhialwigaqnkits 
sddlghalsftsvlphnqilsivksasqfvfsnlclhlvhfksdcdnllprekrvrdaa^ 
alidtlntknnevtsktnkiesdapsltalmlpsgprnrkmdvfsnimaynnnngmdfde 
dvpdndedegclplqeenattlalsnfphdydraikdahqlvtvrivgqgekegvipise 
cldvpeldmtnlipvvtatnviqslakenpgllftihnaalahshregyggshllglakk 
lsrgfinfrqfqnqlfspkkeskimydiflsvkaimarddrydglc dmrmnsmmdasflk 
vrkkpecvfitklldknfrrhi indeeeetrerfggeeeeedddeefedeeeeqaerewg 
eeegesaydisvindknntighdvdi ilcnrkkltltkensvfvnehi dsfmvgnligae 
gsliqicfdnctgefeglpkfclydssskdkdtip 

(SEQ IQ NO: 93) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

001940 (001940) STRAWBERRY NOTCH (SNO) 72 3e-ll 

Q9Y2G9 (Q9Y2G9) KIAA0963 PROTEIN 62 3e-08 

075257 (075257) R31180_l 62 3e-08 

001737 (001737) F20H11.2 PROTEIN 62 4e-08 

0858 62 (085862) PROBABLY METHYLASE/HELICASE 59 2e-07 

064516 (064516) YUP8H12R.3 PROTEIN 53 le-05 

TaqMan Primer/Probe Sets: 

5'start=2131 

5*stop=2154 

3*start=2210 

3*stop=2233 

5 * prime r=GGTGCACTATTGAATATCAAGGCA (residues 2123 to 2154 of SEQ ID NO: 92) 
Tm5=59.03 

3 • p rimer =CAGTTTGCTCTAGTGACATGACCA (residues 2210 to 2233 of SEQ ID NO: 92) 
Tm3=58.06 

probe 1 -TTCTGTCACCGATGCAGT (residues 2157 to 2174 of SEQ ID NO: 92) 

probelstart=2157 

probelStop=2174 

directionl-Forward 

Tml=69.00 

scorel»l .99 

length=103 
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CT505 

Nucleotide 

Genomic coordinates: 

Start: 16983 

Stop: 14064 (SEQ ID NO: 94) 



Amino Acid 

MDERRRDPLLYPTNRSMRFTAQITLFVTVFVLGCIFALVCAAMAYNVAKPMSVNFQAIHE 

LGMKSKLKAVQGANPEKTLEEYLEARGRHDGVEDASNYPPHPALLDMMNLTVKGNKWNVP 

SETKERNSRFESHDLAANRSSSLLVPEHHIDRLSEATIEKSNKYLDAVSGKKFRQRMVNL 

KDNIEKDDTELYDSLFGVHDIHHHSASGVSGDAPPPPPSTSEGHDEDVDILAYNTGGYCS 

NPVPLKEGQTCTSVCYTSRAVRVMTPFVAGGTFITHKSGEDPKPYCWSGNVPGDHIETSP 

TTGERVVKECSVHTSIVVLTDDGGWQCRPKYPTYFGGSGGTSMTACAFNPSTHKGPPPPS 

SSTPIYYDVLKKQQIRNHTEFRNSSYISKLRQSSSLAEFKIKCNDPEFLYKNPITCFCNN 

KKDVLNNDLLSQDVTKDMKFRGMYECMENPCVMMPNIDPSFVTFDVSTMKCVPGVNNPQD 

SNRHAI I GDDRT PLVGTVPAMG I FLADQSKRG DQIHQQRPKS S I DETTAKKI ALAQAP 1 1 

TPLNLDATNTSKNVLFVPIFSTVLPPLENIPHVIIRPSSLLHRSCLAPVLNKPSSGQHRP 

FCTAPFYIEPAANVLAGNIPQKPYEHSMLATECLRNSRMVSGSVHGGSELLFSTLLSQNK 

PSSYIRTPPGGTPAPEYNSTGDQRLEEIRDFFERNFNDERRLSQTEYVIKKHARGMRTSE 

I YLKS S S WDS LMKRKE FLRH 1 1 KKSEDT FVLKEGLLM RS YGP YAATVLARDM FDLDYLKG 

KPASKTSSTLKVSNPLQYAFPTSYSVLPEEGATDDIFSVDHNRIFDSETIPSYFDCSNVT 

PGSEKLFGTSSSSSEYRVDIDDDAWGLQSFRLDHNPKSGPVVQSDPRLAFDASNISSTPE 

GATITPLSLFKKSLVEWGHKKADVQETSWFRDGVDTSEAYRRLLVETSMAVRNSWFSLAW 

ENKNYYFAKNSS 

(SEQ ID NO: 95) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

CUP5_GALME (Q24998) PUPAL CUTICLE PROTEIN PCP52 PRECURSOR ( 46 9e-04 

CFHD_HUMAN (Q02985) COMPLEMENT FACTOR H-LIKE PROTEIN DOWN16 38 0.33 

P89911 (P89911) PROTEIN KINASE 37 0.57 

CAB39619 (CAB39619) AIG1-LIKE PROTEIN 37 0.57 

CAB53064 (CAB53064) DJ15D12.2 (FHR-3 £ FACTOR H-RELATED PROT 37 0.75 

Q24552 (Q24552) TF125 PROTEIN 36 0.98 

TaqMan Primer/Probe Sets: 

5*start=1446 

5'stop=l466 

3'start=1511 

3'stop=1532 

5 * primer =CAGGCATGCAATAATAGGCGA (residues 1446 to 1466 of SEQ ID NO: 94) 
Tm5=59. 66 

3 ' primer =CTTTTAGATTGGTCGGCCAAGA (residues 1511 to 1532 of SEQ ID NO: 94) 
Tm3=59.21 

probe 1=GACAGGACACCGTTAGTG (residues 1468 to 1485 of SEQ ID NO: 94) 

probelstart=14 68 

probe 1 Stop= 14 85 

direct ion l=Reverse 

Tml=69.00 

scorel=l . 99 

length=87 
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CT506 

Nucleotide 

Genomic coordinates ; 

Start: 25878 

Stop: 25197 (SEQ ID NO: 96) 



Amino Acid 

MASVFEDPADLFANMDLTGKVPTRPNILFFEGLLPNSGKEIMENRLIHKGKCGAFEADTQ 
LAYFFPSNNEENTKKLNIGFQIKSNCLSFFIRDFLNDWLEEIKDCGPYCTFSQYMDGDKE 
IFGNSVFGQDFTIVAMDWIDKGVTFYIFVDGSDSMENMASLWMCDKLKRMNANVVKVFVD 
NASKPKFSVCKTCRWEFPGPVSYVIEGHGMGHSDLSCDEISEFLVQ 
(SEQ ID NO: 97) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

AMY_BACCI (P08137) ALPHA- AMYLASE PRECURSOR (EC 3.2.1.1) (1, 35 0.44 

Q85449 (Q85449) PROTEIN 8 32 3 . 8 

Q85439 (Q85439) COAT PROTEIN 32 3.8 

VP8_RDV (P17379) OUTER CAPSID PROTEIN P8 (STRUCTURAL PROTEI 32 3.8 

Q85451 (Q85451) OUTER CAPSID PROTEIN 32 3.8 

Q24284 (Q24284) PLC-GAMMA D 31 6.6 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 652 of CT506: this 
corresponds to nucleotides 25216 to 25867 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=287 

5 • stop=307 

3* start=351 

3' stop=373 

5 * primer=ACGACTGGTTGGAGGAGATCA (residues 287 to 307 of SEQ ID NO: 96) 
Tm5=58. 18 

3 1 primer=TGTTTCCGAAGATTTCTTTGTCC (residues 351 to 373 of SEQ ID NO: 96) 
Tm3=58.28 

probe 1=AGGACTGTGGACCATACT (residues 308 to 325 of SEQ ID NO: 96) 

probelstart=308 

probe lStop=32S 

directionl=Forward 

Tml=68.99 

scorel=l . 99 

length=87 
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CT507 

Nucleotide 

Genomic coordinates: 

Start: 29077 

Stop: 28330 (SEQ ID NO: 98} 
Amino Acid 

MIAIANHKHDVSDALVGAHGAKINMLYGKSSTLSVTEAALLMFNDTALTQFAQRGYEPSI 
PTILKAALDFSLQEEEPLVAATGLDVNKAPRSWPILNCRLGYIASSNYPWAEHIISGDKE 
EIKRALEEHEKNANVRFDSDNCPVCLE DFSSTNIIRTTRCGHCIDEKCWDRLVLSTQRGE 
ITRCPVCRERTSLRPDADQVKEMLVEPIVSCKRMAVPDEQVSCKRRRIGYNRYQFLINDV 
WTDESETV 
(SEQ ID NO: 99) 

Top Blast Hits 





Score 


E 


Sequences producing significant alignments: 


(bits) 


Value 


Q9ZT52 (Q9ZT52) RING-H2 FINGER PROTEIN RHA1A 


45 


7e-04 


CAB51420 (CAB51420) RING-H2 FINGER PROTEIN RHA1B 


44 


0. 001 


Q9ZT51 (Q9ZT51) RING-H2 FINGER PROTEIN RHA1B 


44 


0.001 


CABS1421 (CAB51421) RING-H2 FINGER PROTEIN RHA1A-LIKE PROTE 


44 


0.001 


017099 (017099) F42G2.5 PROTEIN 


42 


0.005 


016682 (016682) ZK1240.2 PROTEIN 


41 


0.008 



Comments : 



TaqMan Primer/Probe Sets: 

5'start=257 

5*stop=27 9 

3'start=308 

3'stop=330 

5'primer=TCAATAAAGCACCTCGTTCTTGG (residues 257 to 279 of SEQ ID NO: 98) 
Tm5=59.26 

3 'primer =CCAAGGATAATTTGAGGATGCAA (residues 308 to 330 of SEQ ID NO: 98) 
Tm3=58.96 

probe 1—CAATACTGAATTGTCGCT (residues 281 to 298 of SEQ ID NO: 98) 

probelstart=281 

probelStop=298 

direct ionl=Reverse 

Tml=68.81 

scorel=l .81 

length=74 
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CT508 

Nucleotide 

Genomic coordinates: 

Start: 30861 

Stop: 29073 (SEQ ID NO: 100) 



Amino Acid 

MAEAAPRYRQVLEEVLENIEPYMSFLDVETERELALLNDIITSRNSPPVPSSSFKKLDNK 

EEFRDI I YFFINNNTKS DSSPICEGMTFINALTTVCKTFRGLYENIHDDFLFVKYSLLVS 

MDNGFLRRETHGIKFGTGDDSRTGFKFTSKEQAEEEREKVMRRIKKLDGVLASLKKSTSS 

ARSGIVFYVEKCSSVIRFRLFSRIVNITSDYVAEMKKSAPLEPFDISFGYKYFVDESPCV 

TKAKRLISNGNFIIGRPFSCLETSPSSVSTDFREEMNMDARSIARLNWTNEERASAYRSV 

IIKSFLSSIEEEMVEEYCETTTKTVAEMAVEFVDVFIEKAETIQHFQTLYSIFDTMPKFS 

AEMMDNILKNVAINEAVGSGLCGAILLWMINSRPFEEIDYNYFKICLREIMVRKKTDKLC 

DNLIVKRIVSHKNWITDPHEVKGYVRLCVKVSCFMEDLEAFLTKNPWLKHTYFDEKGNT 

LLCYCIINKYSHTSKLVKQEKLNILKPSAKGMSPLMVCAAISSPFTTRVGIEILTTNSLA 

FSFINENNENVFHAAAVATSCNFLDALAKKYKNI IYDFDRSIVNARRRAMVQRP 

(SEQ ID NO: 101) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

CUL1_CAEEL (Q17389) CUL-1 PROTEIN (LIN-19 PROTEIN) 36 0.99 

023296 (023296) HYPOTHETICAL 74.2 KD PROTEIN 35 1.7 

Q57577 (Q57577) CARBON MONOXIDE DEHYDROGENASE CORRINOID/IRO 35 1.7 

GIDA_COXBU (P94613) GLUCOSE INHIBITED DIVISION PROTEIN A 34 3 . 8 

Q47427 (Q47427) PLASMID P15B GENES R, S, SC, SVMl , SVMR, SV 34 3.8 

Q39068 (Q39068) CYCLIN 2A PROTEIN 33 5.0 



Comments : 



TaqMan Primer/Probe Sets: 

S'start-742 

5' stop=764 

3* start=822 

3'stop=845 

5 1 primer^TCCAATGGCAATTTTATTATCGG (residues 742 to 764 of SEQ ID NO: 100) 
Tm5=58.75 

3 1 prime r=GATCTAGCGTCCATGTTCATTTCT (residues 822 to 845 of SEQ ID NO: 100) 
Tm3=57.51 

p robe 1 =CATCCGTGT C AACTGACT (residues 797 to 814 of SEQ ID NO: 100) 

probelstart«797 

probelStop=814 

direct ion l=Reverse 

Tml=68.91 

scorel=l . 91 

length«104 
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CT509 

Nucleotide 

Genomic coordinates: 

Start: 37245 

Stop: 36048 (SEQ ID NO: 102) 



Amino Acid 

MAGTDIISSSSSGSSSSKKGGCIVSKKGKTIKGKNIVFKTSIKTSSSSEMMKKHKKRMEI 

KDMVKKCASCKKVDYSSSTLENDALRASIESTCSALNRFPEIKYGEGEIGDVLSAIRI^A 

GCLLAKNEKSFYKFFLRGFQFDKNGFMMLSEGMKRIEKMHTKIAKKVFGGCKAAPLKEDR 

EGKIPCQEFHKPSSYKGEYTTPLPPTPAPVKVLPPLLPYKNVKNKPVFVPDLAVGEAKKP 

CWVHKLFSDDPEERKRLFERHQAGRRDALMEDYGVIPNNDNEAEDTERFVSNALEYQAQM 

LELLDTANMPPPASTPVRRGRTRIVRDYDASPVPSPYSSPLHTPFDAPNVNLNPGSGRMV 

DRVRDGRRNTSRRTSAVMARRINQLQHQFLYYSSDSDF 

(SEQ ID NO: 103) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

001693 (001693) C0SMID T08B2 39 0.073 

CAB52863 (CAB52863) PUTATIVE MEMBRANE PROTEIN 36 0.37 

001348 (001348) ZINC-FINGER PROTEIN KLU (KLUMPFUSS PROTEIN) 36 0.48 

Q9ZDN6 (Q9ZDN6) VIRB10 PROTEIN (VIRB10) 35 0.83 

Q9Y2W9 (Q9Y2W9) ENDOCRINE REGULATOR 35 1.1 

GAR2_SCHPO (P418 91) GAR2 PROTEIN 35 i.i 

Conunents : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 973 of CT509: this 
corresponds to nucleotides 36148 to 37119 of the genomic reference sequence. 
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CT1070 
Nucleotide 

Genomic coordinates: 
Start: 249425 

Stop: 253221 (SEQ ID NO: 104) 



Amino Acid 

MASSGGFFTGIDDLFKTVIQQEKQEKNKPTQAPETEPKPGPSQAPDPVPDPVPKTPTNFC 
PPPPNPLPPPPPPPPKPSREERLKTSKIRLNKALSDIVEATNERVDALKENQALNTEYDK 
KDNYFQVLKCSITPSVPTAIIGAHVKQVAKSSEIEIJWNELDIKNKCSLVYNENESLKFF 
RDHENLILQIAVQLFSRHDNTKCVGAEICVKGNEKNKFVNKLVVKKLPNAPSSSSTVLEI 
RGATRNLLENNFNKGENNTVNENKDIPPSERANLDTTKAEISHVFSTLHRLDTKRKLFFK 
GNTFYQRKPTFDNKFRWTEVIGWTESEASKQTTKSLDKPTDDNLFVLPHSFNNLADHLRL 
KFKNVLYKNSTAHPGKRNYYKTQETLINPQIDSAKEYKMVFAEIDKCLDVLLAIGKNDKY 
TKSTVIQYRGKFRRYLIFCYAFYALNKAKHSRAVSPLPFNFFNLFSFMYCHGPFLHSASF 
LSTLTFVYQHMFFPMGTAAPSVSAKRLMDIDSALMKGGKGVGVRDFGSPSKTSLHTRTLV 
SFLGFAEMAMGTMTALLSGVEVRVSPALQQRISKSLERWCDSVIFIYFTFVLFHRFSGAK 
KVSLESALRLIMGQTHAHTNKVRAAKRCRIEAAEMEGVEEEEAGLTLSYAHLLGLPYSIQ 
KALGLPVPKINPI^TASSSQYNLGDFVGVEQLLKAKREFPAEGETAGFLGMFDNLVKDSI 
DKYYGEGAFSDVVENVKQGMEQNTPYDTSSALMTPIPKAFYEEEKDVPQQEENSTQQRYR 
LNRDVEEYLMAS PMKMVFVS I LDKTNQKERFMSVGDI ALLAVWCKRNVLKKDWNEYAI AK 
GNYEWLGAKMCNHLLLADLVNFGILGDLKITNKLDTNTDTFHRDSDRLPSVADQKKFIKN 
TSLSDRKQLALVHSCVNVSTRTHVGRVTATSWAVDALRTYTRGDKDMFAALSSSLDMYHL 
GHTNSANFVPYFSRNYLCNEQENGLWGYTRRTSEKLAKEELGRGRLGGLNKVGVAKTELA 
AAAIAISSALDMGEVEAVMDDSSKVRKIASTCLNVNAAKVSAAREKAREASIKRLLLATN 
APAAGSSRNSNRFLLKDLWGFFSDPDKRQKLIKGEAVSVLCPNTGFLHAAVPDFVIEYSF 
ESETSIVRLRLRLIKPEKQDEMVCPSTAPEANKKRKLVRNNQDAVLTLDDEDNIVKYNKY 
DMVEDEEARERLRHQDKQSVIAARISKVCERKNPKKKRRLEDPELQSVDEQLIRELAAIA 
Y 

(SEQ ID NO: 105) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

SSGP_VOLCA (P21997) SULFATED SURFACE GLYCOPROTEIN 185 ( SSG 64 8e-09 

YPRO_OWEFU (P21260) HYPOTHETICAL PROLINE-RICH PROTEIN (FRAG 62 2e-08 

013305 (013305) PROTEASE 1 61 5e-08 

P93797 (P937 97) PHEROPHORIN-S PRECURSOR 61 5e-08 

AAD37432 (AAD37432) M-LIKE PROTEIN PRECURSOR 59 2e-07 

Q54071 (Q54071) M PROTEIN PRECURSOR, MSZW60 59 2e-07 

Comments : 



TaqMan Primer/Probe Sets: 

5 , start=1884 

5»stop=1904 

3'start«1931 

3'stop=1952 

5 ' p rimer =CCGAATAGAAGCAGCGGAAAT (residues 1884 to 1904 of SEQ ID NO: 104) 
Tm5=59.32 

3 1 primer-TGGGCATAAGAGAGTGTCAGGC (residues 1931 to 1952 of SEQ ID NO: 104 ) 
Tm3=60.21 

probe 1=GAAGGTGTGGAAGAAGAA (residues 1906 to 1923 of SEQ ID NO: 104) 

probe 1 star t= 190 6 

probe 1 S t op=* 1 92 3 

direct ionl=Reverse 

Tml-68.98 

scorel=l . 98 

length=69 



FIGURE 3, Sheet 58 of 160 



WO 01/38351 PCT/US00/28888 

100/201 



CT1071 

Nucleotide 

Genomic coordinates: 

Start: 253296 

Stop: 255120 (SEQ ID NO: 106) 
Amino Acid 

MRDDTFNQETAVKLVRWYTEYDCCCPLVNRVERLLGSFGGGVDATSVRSRPALYEEDKKG 
DKCIPFRITSLIEGILLERALTKPDLAAAAFDVSEKLVYCSCNNTQGNFDVSSMTIWIDG 
NNSKKYEVTCPSCTVEKISGGAESIHKKPMSLLAFFNNLVEKEAFAERIELKKLYLSLLT 
GSAAGGGGMYKDSSQQSSFNGSWTSLLFHTSKKDKTRLEAEVLVSNKIKHTSRLQPRCVC 
SDLLYALCSTTNNSASYAYKARNLCVIEGGEFLYFKYTIFEENGPFDSKTDLQSLVNNEP 
VSETNSSALAASSSSLEDDDDCCDDDDDDDDDEDEKTKKKQPKKQTKKQKTTTSTLPPIS 
KTNHDNMLMNVLKKGAVNGKRKMMDSLSGKKGQHSKKLKTSAAAGGGASSDVVAGENEEE 
NNPSSVSPTNNRDRKDYVLPCPQIEEVTIFSQHRMNNNKLAESVVKHSVVINGNCLNLFV 
TQHRKKYILPHENILFCPPLVQHVG FNKFRILTGVSCFFDRIEI VFSDQSDSWLSNNAA 
HSAILRLLSYIRENSLKRSVRTASVKGIDFVVKSQDTNIGIPLSNKEIRERQLCSASTLS 
MLAGLGK 

(SEQ ID NO: 107) 
Top Blast Hits 





Score 


E 


Sequences producing significant alignments: 


(bits) 


Value 


Q20497 (Q20497) F47A4.2 PROTEIN 


52 


2e-05 


KS1_HYDAT (P38978) KS1 PROTEIN PRECURSOR < HEAD- SPECIFIC PRO 


50 


6e-05 


YIL2_YEAST (P40480) HYPOTHETICAL 123.6 KD PROTEIN IN POR2-C 


48 


2e-04 


FKB4_SPOFR (Q26486) 4 6 KD FK506-BINDING NUCLEAR PROTEIN (PE 


47 


3e-04 


Q06459 (Q06459) NUCLEOLIN 


46 


9e-04 


NPM_XENLA (P07222) NUCLEOPHOSMIN (NPM) (NUCLEOLAR PHOSPHOPR 


46 


9e-04 



Comments : 



TaqMan Primer/Probe Sets: 

5 B start=1007 

5'stop=1032 

3'start=1084 

3*stop=1105 

5'primer=AAACTAAGAAGAAACAACCCAAGAAA (residues 1007 to 1032 of SEQ ID NO: 106) 
Tm5=57 . 02 

3 1 prime r=TCAACATGTTGTCGTGATTGGT (residues 1084 to 1105 of SEQ ID NO: 106) 
Tm3=57.44 

probe 1=CACTTCCACCTATCAGCA (residues 1064 to 1081 of SEQ ID NO: 106) 

probelstart=1064 

probelStop=1081 

direct ion l=Reverse 

Tml=68.99 

scorel=l . 99 

length=99 
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CT1072 

Nucleotide 

Genomic coordinates : 

Start: 255074 

Stop: 257477 (SEQ ID NO: 108) 



Amino Acid 

MLSFNPEYASWFGKMITDPGVILPVSKDVVLFGSRGQSDVGIMTLDPHDLDIKITSKRIG 
VEERLAQYNTLPMDFTRAMEKELNNSRNMKESIFTGIFLDTGSAIFEDNMFNGGGSALRL 
IRSPALNSAVFSSKNYIIKQLPTITKSLRRSQARDKQVDKTREKIVVDSFSILSAIAAQV 
MHLTDGEMTYVPDGHCVNWMSETNASSIYLIINDPTGSGWKIMPNNFNKTLEMRDGVID 
RVETLVEFACKCVASSLIKRGMDLVDMQRTIRSMDFLPPASSTSNNTPRVAIMTSGSSTT 
TGIGSLSILAE DGSTHHQIKLSEYRTGLSITENNREVSFTVEPSIDGVQAEHPLSPSILQ 
WLPPLVKRPEVVAAAAAAVVEEENGDNKPSDKDNEDKYSDTDFWSNVPVTPLITPKKWRA 
CKINDRAMISSWKNNLVKLHKYDWTNKTTKVDYFDKMAAFVALMTFRKFQDILADNYVPP 
QTPSQGSEYAVTMSNVATLFTDVYGFESNGNKPLFALEQLENETGIESI YVLNIIGNSPD 
GNSVRVVRLEKEMSFLLKAKQYFTEMAIPPINEKCKWTDKAPSSVKEYKYFCDLTAPISK 
RPRKDNNDGGVEHSALTYTPRCIYHTERCLVHLYSEPEKITEHVSFNKDLNILEIGKNIT 
NQYQTNYKSIFEIVDVPIIVASMSSTKTMTVNNYIISTPSATTKFVQDPPKTGKQLLAVE 
EVRNFKLKSVLVPPPYFRDNKRNTTLCSQITEQNCPSSSEGGRFSCPSESLILKYSNLSK 
KRALEEIAPETETSILSLAM 
(SEQ ID NO: 109) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

TFC3_YEAST (P34111) TRANSCRIPTION FACTOR TAU 138 KD SUBUNIT 39 0.12 

Q9XIC1 (Q9XIC1) F13F21.2 PROTEIN 38 0.20 

P95629 (P95629) PUTAGENE 37 0.60 

Q9XGN1 (Q9XGN1) TTG1 PROTEIN 35 2.3 

085475 (085475) CELL DIVISION PROTEIN 34 3.0 

Q00741 (Q00741) TAMA 34 4.0 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 932 of CT1072 : this 
corresponds to nucleotides 256520 to 257451 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=1048 

5*stop=1068 

3'start=1099 

3 , stop=1120 

5'primer=GCAGAGCATCCTCTATCCCCT (residues 1048 tol068 of SEQ ID NO: 108) 
Tm5-58.38 

3 , primer=CTGCTACCACTTCTGGCCTTTT (residues 1099 to 1120 of SEQ ID NO: 108) 
Tm3=58.57 

probe 1 =TTCAGTGGTTACCTCCTC (residues 1076 to 1093 of SEQ ID NO: 108) 

probelstart=107 6 

probelStop=1093 

direct ion l=Forward 

Tml=69. 00 

scorel=1.99 

length=73 
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CT1073 
Nucleotide 

Genomic coordinates: 
Start: 257551 

Stop: 259132 (SEQ ID NO: 110) 



Amino Acid 

MGVQKNILVGGGGGVSLLLGVVTLLGTVTEGAPAVPPFSSSSYSFTPESSVFWVEGNRVL 

SGTKKDTLINVLGKKI PYYANSIFRHDCSETRSIQWPETSPLGLNLI FCSCASHEHQHRT 

HETTEPDDLLWDGSRKTTTIILPKKWWSDVVWTSLWRDNDQKCGCGQAFVSSFTSTQKEV 

QGEWLAAHTNGKTSEGDTNSAYLFISLQRTTLKPIITDVTE DNMMMGRMSGTPMNPKDMT 

YFVNDFSDDIGSTPQCLVSNSDILNKREEWIAVWGVADSKDLLTKHQLGEREYGSEGRRR 

NPGVEEEEEERVEEEEEVEVALPYIKKSGKLIGPRRRPLTTTTTTTTTTTTNPIVREWE 

DFDYESFNEPEIFGSNSKLPFIRFLDQKNWRLGIMSRVSSSIANFKIEQESSKALFCLAV 

WVGDEHTPKFRLSVWKNWKPFTSAPIIVQNVGYSSDVFWHETLRSKIVDRSRDLIETKVT 

KKIGEDWANKKQTVVAMFISGIVCITVTVISIFSIVI YYKIKMPKF 

(SEQ ID NO: 111) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



G1184543 (G1184543) NAD+ : PROTEIN (ADP-RIBOSYL) -TRANSFERASE , 44 0.003 

017412 (017412) CHITINASE 41 0.015 

AAF00095 (AAF00095) HI STONE ACETYLTRANSFERASE MORF 39 0.10 

AAF00100 (AAF00100) HISTONE ACETYLTRANSFERASE MORF BETA 39 0.1O 

015087 (015087) KIAA0383 (FRAGMENT) 39 0.10 

AAF00099 (AAF00099) HISTONE ACETYLTRANSFERASE MORF ALPHA 39 0.10 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 511 to 925 of CT1073: this 
corresponds to nucleotides 258 607 to 259021 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=698 

5'stop-721 

3'start=823 

3'stop=8 4 6 

5 , primer=CCATGAACCCTAAGGATATGACAT (residues 698 to 721 of SEQ ID NO: 110) 
Tm5=57 .04 

3 1 primer =GAGGTCTTTAGAGTCTGCAACACC (residues 823 to 846 of SEQ ID NO: 110) 
Tm3=57.2 9 

probe 1 — TTCGGACATCCTGAACAA {residues 780 to 797 of SEQ ID NO: 110) 

probe Is tart =780 

probelStop=797 

directionl=Forward 

Tml=69.00 

scorel-1 . 99 

length=14 9 
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CT1074 

Nucleotide 

Genomic coordinates: 

Start: 274526 

Stop: 275153 (SEQ ID NO: 112) 



Amino Acid 

MYIFVEGSPLTGKSSWMSKLIDTGSCGMSFLNFLRMNTSDYYNWPAEIGTEHLQLGFRET 

RWDGMFEPVLKTFVDSWKKEQGKESLKEYLDYNGQVMEIYIAEWLRQRPLAFHVFTYTD 

EAVKSGFLNEEDLDMDTATKWMAEIIREKRGNIQEIKVTPRVVFNGNVCSACFSNTKRNL 

YNFGTNYNNVVHCDLLCPFARHRIVHFL 

(SEQ ID NO: 113) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9Y7X6 (Q9Y7X6) HYPOTHETICAL 75.7 KD PROTEIN 38 0.079 

Q9YBC0 (Q9YBC0) 431AA LONG HYPOTHETICAL PYRUVATE DEHYDROGEN 35 0.52 

067329 (067329) DIHYDROOROTATE DEHYDROGENASE ELECTRON TRANS 33 1.5 

DP01_BORBU (051498) DNA POLYMERASE I (EC 2.7.7.7) (POL I) 32 4 . 5 

YHC3_YEAST (P38742) HYPOTHETICAL 130.0 KD PROTEIN IN SNF6-S 32 4.5 

051342 (051342) ATP- DEPENDENT CLP PROTEASE , SUBUNIT A (CLPA 31 5.9 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 563 of CT1074: this 
corresponds to nucleotides 274555 to 275117 of the genomic reference sequence. 
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CT1075 

Nucleotide 

Genomic coordinates: 

Start: 277704 

Stop: 278079 (SEQ ID NO: 114) 



Amino Acid 

MWRSCISNIREMGDNKDYETRLIQRINDLESEIENKTELCEKINEQMKNTQLKYDKCFVE 
EETEKFRKMEERVKYLKEQGI PLDPEERRTMLAEIDKSNKELDALLEENERI IKLIDEEL 
ESMK 

(SEQ ID NO: 115) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

044929 (044929) MICROTUBULE BINDING PROTEIN D-CLIP-190 47 5e-05 

Q13439 (Q13439) TRANS-GOLGI P230 (256 KD GOLGIN) (GOLGIN-24 43 7e-04 

Q59037 (Q59037) HYPOTHETICAL PROTEIN MJ1643 43 0.001 

Q25662 (Q25662) REPEAT ORGANELLAR PROTEIN 43 0.001 

KINH_DROME (P17210) KINESIN HEAVY CHAIN 42 0.002 

AAD29948 (AAD29948) MYOSIN HEAVY CHAIN 41 0.002 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 388 to 1 of CT1075: this 
corresponds to nucleotides 277689 to 278076 of the genomic reference sequence. 
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CT1076 
Nucleotide 

Genomic coordinates : 
Start: 278221 

Stop: 278473 (SEQ ID NO: 116) 



Amino Acid 

MQKKYDKLFEDDKRFREIEERILQQKEKGNPLDPEERLVLSADIDRSMKEIDDCLEEINH 
IELSI DTLLDECENLHYGLQTTK 
(SEQ ID NO: 117) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

018164 (018164) Y6B3B.1 PROTEIN 35 0.076 

P70388 (P70388) RAD50 HOMOLOG <S. CEREVISIAE) 35 0.10 

Q06268 (Q06268) INTERMEDIATE FILAMENT PROTEIN 33 0.30 

USOl_YEAST (P25386) INTRACELLULAR PROTEIN TRANSPORT PROTEIN 33 0.39 

Q07380 (Q07380) HYPOTHETICAL 206.5 KD PROTEIN YDL058W 33 0.39 

096275 (096275) RESA-H3 ANTIGEN 32 0.51 



Comments : 

Hit to public SBV sequence: 

gi 1 6007410 |gb| AF178573. l : CT nucleotides 13 to 252 match nucleotides 2050 to 2289 of the 
public sequence with a 100% homology, a score of 476 and an Evalue of ie-137 

TaqMan Primer/Probe Sets: 

5*start=88 

5 r stop=lll 

3'start=147 

3 , stop=170 

5 1 primer=AACCCTCTAGACCCAGAAGAAAGA (residues 88 to 111 of SEQ ID NO: 116) 
Tm5=57.69 

3'primer=TCCTCGAGACAATCATCAATCTCT (residues 147 to 170 of SEQ ID NO: 116) 
Tm3=57 . 97 

probe 1=TATTGTCGGCTGATATTG (residues 116 to 133 of SEQ ID NO: 116) 

probelstart=116 

probelStop=133 

direction l=For ward 

Tml=68.98 

scorel=l .88 

length=83 
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CT1077 

Nucleotide 

Genomic coordinates : 

Start: 278723 

Stop: 280976(SEQ ID NO: 118) 
Amino Acid 

MEKKTETAATTEKDPEPSVSKRSRNKEPKTTSTVYTSVKCYLSSIIKSESSRSNVTSTKE 
RFEERCKSVSKMMVKGSLFLRLVVDECLRRYNHLEDEIDKWPDMTKDN FYVQLLRKGLDK 
KKLKEGSTHPVVEDVWNSPIVQETFLSQQGEGNNPIKRHLMDFNTITYAAKQLKTCFETN 
LRTH FRTRQQRAI SGWLAENGFDKKYTKLVQHWI IGCT YKSDW VDSGDLER VKEGTKN FV 
TLHRKHLCVISDKKNGTISYSPEEKYPIPSILNYYKFLQTEY PQNKKIQKMIVVPKHKLK 
IHYCTFDQTTIQGICKDLGVWKDMEERHKQSEDILYKQGWYLLFDVKKIKKLRPNWNFHS 
IQTDGEGVSVLFSREVEEVETVSKKSKKNKKPRGDEDRRNYPPTNAKYVVGVDPGRTNVV 
SCSVFDTRQKRVVRKHRMTAKQYYQESWMTDRRKANETYKKNNKEYKEALEEITRYDNGE 
E I IN DGNGDTSTPTKKFEAYLKVVNEHYRLLWNEKGKKK YRKNAMKV YSRKQKC ISNFID 
ELI PKRDKIEDYHIAFGDAKFACTGRGEQYAS PARI FAKKTKERVGGDKRFTFVDEKYTS 
KVCHRCNQPLNMLEKDCFSPNKKRKPPTIVTTTTTTTTEEDEENGKWKKATPLRENRDTR 
RCSSEKTQFGYSSNRKVSTGDISMETPVPSSTSSSFCTPTSITCVLGGKFVDRDFNASTN 
I VHKFLGFWDKKLMEKKDKMPLKYHFI RVA 
(SEQ ID NO: 119) 

Top Blast Hits 

Sequences producing significant alignments: 
Q9ZAY5 (Q92AY5) SURFACE PROTEIN C 

CYL1_HUMAN (P35663) CYLICIN I (MULTIPLE-BAND POLYPEPTIDE I) 
062231 (062231) F35E2.9 PROTEIN 
Q12080 (Q12080) P2610 

MYS2_DICDI (P08799) MYOSIN II HEAVY CHAIN, NON MUSCLE 
069188 (06918B) C3-BINDING PROTEIN 

Comments : 

Hit to public SBV sequence: 

gi|60074i0|gb|AFi78573.i: CT nucleotides 12 to 297 match nucleotides 2 
public sequence with a 99% homology, a score of 551 and an Evalui 



Score E 
(bits) Value 

46 7e-04 

41 0.022 

39 0.086 

39 0.11 

39 0.15 

38 0.33 
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CT1078 
Nucleotide 

Genomic coordinates: 
Start: 282175 

Stop: 282586 (SEQ ID NO: 120) 
Amino Acid 

MGNSESRSSGIEIVHKNGAPKRSHKTLYLSNRTERHAQIQKQIEELHHKTNKQFEQAQKV 
LDKNEERKKHQQQQQI I IPLDPEERRAILAEI DKHMKEIDGFIEESERLGLLV DAEINNL 
EEKEVEEEHLLKQKED 
(SEQ ID NO: 121) 

Top Blast Hits 

Sequences producing significant alignments: 

ARP4_STRPY (P13050) IGA RECEPTOR PRECURSOR 
AAF05247 (AAF05247) ORF133 

MYSG_CHICK (P10587) MYOSIN HEAVY CHAIN, GIZZARD SMOOTH MUSC 
AAD4 6S01 (AAD4 6501) LATENT NUCLEAR ANTIGEN 
040947 (040947) ORE 73 

Q92GM5 (Q9ZGM5) M-LIKE PROTEIN (FRAGMENT) 
Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 5 to 34 4 of 
corresponds to nucleotides 282190 to 282529 of the genomic refe 



Score E 

(bits) Value 

45 2e-04 

43 6e-04 

43 8e-04 

43 0.001 

43 0.001 

42 0.001 
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CT1079 

Nucleotide 

Genomic coordinates: 

Start: 286863 

Stop: 289635{SEQ ID NO: 122) 
Amino Acid 

MSSSSSSSFS FRISTYQTFLKALAHPDLVDKITQKCDETGRNQKCPIQFLADISHLIQGE 

RNGGNLFPLHPFKNQPHLEPRIVGSLHGRTLDNDIEESYCYFVKDLYNGVFSYVNGVKEL 

QGVLDKKISGSGSGESSSSRAPLIPITDVDLLYIFGTLVVLPPRSKAYRVITEAVLALPF 

NEFSNNWP PTN I KGAY VSR DFRM FNLLAGL DH I EGE VGGESEWES I HAS WKRMVTIMRN 

KAEKKPPSTSRIFRVYVAE PVNDAVTKI PIRVLSKLFGSRLAGILQKVYSYSMLNLPYLL 

SSNSIDIKQGVKGITLSIPSARKLGFYLLQKDTTLQSSLSQDVADCIVSINAGIIGDDFS 

EKIRQCIEEKNKPENCCMCFCEI DKTPDFSYSEHVARHNFFPVHAFSSSHDDKCCGAKIC 

SECIFPYI ISLYEKMTGVAGVKVVDLFQCPGCKSGMLNLKGRCYEFSNLCKRMILPYTST 

HCSSLFDATINRAEACFYSLEFLQYDFETARRIAHGAKDIPHVYNKVVKNVKDLDRLCAL 

YCYKCVSPVVCDEPNESTDYEMVDVTPPLINLTEIVDSEEYDDGPGNHMWPAKFTCNFIA 

GSSGETPTISTCRDAVTFLGRAPRKKMAGWDDQSAVGQAIIALANWRKSGELPKNMFDLL 

EGVNAVLYRGDSFLLRAINYPCVIGRSMSPSLELVKRKVNKIALIKAFFHEKRVRPDASK 

KLLEWAELLVKSYLMEVLLQTPECVIHRAHSFVGKTLLITDELVHMRPDDATRNAYIQNL 

NAARQNAAAAASFSGSLPKPEFVPCKERTIEWMYEKDNDDVRVVNCPSCKKAIQKYGGCV 

NVFCECGTNMCWICEEKVSPADSNHCVEKHRIVYSNCVRVKYALESMYGFEICTMKNVEE 

GVKNYYVMENGFFFDVQEMVAKK 

(SEQ ID NO: 123) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

AAF04637 (AAF04 637) HYPOTHETICAL 84.4 KD PROTEIN 213 3e-54 

097226 (097226) PFC0175W PROTEIN 49 le-04 

Q94981 (Q94981) ARI PROTEIN 48 4e-04 

CAB4S785 {CAB45785) HYPOTHETICAL 262.6 KD PROTEIN 4 6 0.001 

CAB36714 (CAB36714) HYPOTHETICAL 68.5 KD PROTEIN 45 0.O02 

Q9XII0 (Q9XII0) F7H1.11 PROTEIN 45 0.003 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 910 of CT1079: this 
corresponds to nucleotides 288641 to 289550 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=1148 

S'stop^lie? 

3»start=1192 

3 f stop=1211 

5 * pr imer=TTGACAAGACGCCCGATTTT (residues 1148 to 1167 of SEQ ID NO: 122) 
Tm5=59.14 

3 1 p rimer =TGGACGGGGAAGAAATTGTG (residues 1192 to 1211 of SEQ ID NO: 122) 
Tm3=59.35 

probe 1 = AGTGAACATGTGGCAAGG (residues 1174 to 1191 of SEQ ID NO: 122) 

probelstart=117 4 

probe 1 S t op= 1191 

directionl=Forward 

Tml=69.00 

scorel~l . 99 

length=64 
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CT510 

Nucleotide 

Genomic coordinates : 

Start: 38917 

Stop: 37381 (SEQ ID NO: 124) 



Amino Acid 

MAETVAVDEVPTCPICMGDYDSDTDCYNWSNGGMPCCRKSVHLECLFTWRFEEHMVNENH 

LLCPMCRAYI PPVWFFRKVYEEVYKYASFHSFLLSADYVNDEGVKDTLNKMSTILAPTFF 

V PNAKGVNENE DV YMERAYTKLS FMLET LS RQEMHAFS EETFE DNHE AALMGKFKDI P P Y 

EYEGEWLKYVAPNTIDITQCLSNDDDDDEGDNNVSPSLLSGVTSFNFIEDDEDTVVFVPP 

EVDDNDDSESLPDLTVPPRSNNITFDTISGISSSLYDVNDDDDDDDTMSLPDLNMPSAST 

SSAPTSSAPTSTSLNINVNLCFNVDSDSDDEEVIPSSSSVNQPSTSSGSSSSSSNSRKRP 

RYGRDEDRMSNISSESKRLCVDVKRYMCRLDNIDEEYNEIANRYLAELSALRERRQETEN 

KLGDCISRGNLFHTTVNDVIGKSLCSKKLKVKRKYASKWSANKQLIGSCLIKSASNNARL 

DDEIAHVHSSLLNGFDTDPSEAE>QISSLPNL 

(SEQ ID NO: 125) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



070567 (070567) DENTIN SIALOPHOSPHOPROTEIN PRECURSOR 
P97399 (P97399) DENTIN SIALOPHOSPHOPROTEIN PRECURSOR 
SR40_YEAST (P32583) SUPPRESSOR PROTEIN SRP40 
095815 (095815) DENTIN PHOSPHORYN (FRAGMENT) 
P87736 (P87736) RING-FINGER PROTEIN (FRAGMENT) 
Q53653 (Q53653) CLUMPING FACTOR 

Comments: 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 1 to 444 of CT510: this 
corresponds to nucleotides 37411 to 37 854 of the genomic reference sequence. 



56 7e-07 

56 7e-07 

53 4e-06 

53 6e-06 

51 2e-05 

51 2e-05 
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CT511 

Nucleotide 

Genomic coordinates: 

Start: 40718 

Stop: 38972 (SEQ ID NO: 126) 



Amino Acid 

MGGPTVITTTINTGGDHHHQQYVYHQGNKKRPVEEYNNNNYASGSTSEATTVPAYNNNNN 
NITIKTWDDVINLSITPPPPKRFKKSEVAPSPPTTRTFSNVCASKVIRQCKRQYNEWIER 
DSPYYFKGIEKSCSLEDNYDTCQQLRIGHRSIVKSSKYVHDTCFYGKDPKVGFYWPTSSC 
DEEMRFFDTRHILKELSSRNIPSSQIMDIMYMAVEVFQLPSSACERIRQKTSTLIKEVSD 
QCENWENFRKTARGCLSDLVEVPEDVKDFNTFICPWETFFEIKYGVYYIVNRGTVVKFMK 
DMN YEE FVFECVNGLS VYRKN I KGVVGVTGVC PQGLCLEM P FAG I S I DDV I RCVKDSLDG ' 
GEYYESRDARLLYGVVMLQRMGRLPEVKGVDTVAPITDSFIARKVVRSMFEKLKVNMPFV 
LAETCNVITRVANEGIINVDIKADNFVIDSISGQPKMIDLGLSYPLGYCYNDEYFRNTEE 
LIRQYIHTPPEFFRGHCLGAYSMTYSFSVMASSILEDVVACSNMEGPAFNLMSNMHFLML 
LQSGTDTDFYQNRPSITEYALAMKHIFPFKGTVMNLFKVKK 
{SEQ ID NO: 127) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q15208 (Q15208) NDR PROTEIN KINASE 41 0.017 

Q56921 (Q56921) PROTEIN KINASE A 41 0.017 

085239 (085239) PROTEIN KINASE YOPO 41 0.017 

068717 (068717) PROTEIN KINASE A HOMOLOG 41 0.022 

CAB54949 (CAB54 94 9) PUTATIVE TARGETED EFFECTOR PROTEIN KINA 41 0.022 

YPKA_YERPS (Q05608) PROTEIN KINASE YPKA PRECURSOR (EC 2.7.1 41 0.022 

Comments : 



TaqMan Primer/Probe Sets: 

5'start=878 

5 ■ stop=902 

3'start=981 

3*stop=1001 

5 1 primer =GGACTGTTGTCAAGTTTATGAAGGA (residues 878 to 902 of SEQ ID NO: 126) 
Tm5=57.47 

3'primer=TGAGGACACACACCAGTCACC (residues 981 to 1001 of SEQ ID NO: 126) 
Tm3=57 .98 

probel=TGAGTGTGTTAATGGCCT (residues 927 to 944 of SEQ ID NO: 126) 

probelstart=927 

probelStop=94 4 

direct ion l=Reverse 

Tml=68.98 

scorel=l. 98 

length=124 
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CT512 

Nucleotide 

Genomic coordinates: 

Start: 61872 

Stop: 60672 (SEQ ID NO: 128) 
Amino Acid 

MLSTCDLKHPSSTDGNVLKNIHFSESIPANDIISFPSSDTEELNKDLLDSVRNQIKFGFD 

PITETLKNCITTQTLLHSFLKSSLLTLQEKFNEWGSIQLEKGGQEMALCASLKIMGQISA 

LIETAKEASMDNKKKNNNACANCRDSKCSASLVTLFNKTIDEKYVKQNSSSASALLANTF 

TAGANKPPKEFITKDNAHGNSDTNYTAMSDNLICPGKYYSSDITYEVTKQAKERIKNNNK 

KMRLATGVEMVMKELEAENNKEGGRVEVEVEGVEQQQPSTSGEEMQMEIMLPTPPPPDLE 

SLVTEGVDDYPVFSPLPSLLSPMPASPLPSNGNSALEDGGPFAPSADIVVDKTSEIMGRT 

PGSEWVHQRDRNSKMEIRNYGARGSGINTGRYRRNNTVL 

(SEQ ID NO: 129) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



GARP PLAFF (P13816) GLUTAMIC ACID-RICH PROTEIN PRECURSOR 


38 


0, 


.12 


YMEI_YEAST (Q03433) HYPOTHETICAL 32.0 KD PROTEIN IN CAT2-AM 


36 


0. 


.36 


Q9WTU0 (Q9WTU0) PHD-FINGER PROTEIN 


36 


0, 


.36 


015029 (015029) KIAA0312 (UPSTREAM REGULATORY ELEMENT BINDI 


36 


0, 


.36 


MML3_MYCLE (006081) PUTATIVE MEMBRANE PROTEIN MMPL3 


36 


0, 


.47 


Q06166 (Q06166) MATURE PARASITE- INFECTED ERYTHROCYTE SURFAC 


36 


0, 


.61 



Comments : 



TaqMan Primer/Probe Sets: 

5'start=549 

5*stop=570 

3'start=626 

3'stop=649 

5 ' primer =TGCAAATAAACCACCCAAAGAG (residues 549 to 570 of SEQ ID NO: 128) 
Tm5=57.79 

3 'primer =TGCCTGGACAAATAAGGTTATCAC (residues 626 to 649 of SEQ ID NO: 128) 
Tm3=58. 13 

probe 1 =GCACATGGCAATTCTGAT (residues 589 to 606 of SEQ ID NO: 128) 

probelstart=589 

probelStop=606 

direction l=Reverse 

Tml=68.99 

scorel=l . 99 

length=101 
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CT513 

Nucleotide 

Genomic coordinates: 

Start: 77506 

Stop: 76273(SEQ ID NO: 130) 
Amino Acid 

MEEHLSFNKPSPENGVVFFDFSDNTSMSNMVDNIRHRLPMDKKFSSKALLLASTPIPSDE 
QLSTKVNKAI FSHRETIVLSKALKIVVTGLYVDGEYVDDVICLYPEKHTLNGILRYVVHL 
NMMLMDKAEDADEIRCGLIPLGRGFNREAFKFVDPVIPCAGYNILNGYHPDNGHQISPSS 
TQPQVQRRCAVKQMYKQINGMFEVVKQFSIKHNNRIFTINQVDFKGEEMKMFFALYSEEL 
LPFYSETGKLLSEKHVSKSFSQLPPHVTISVFYLRNMEEYNTLMKTDFGSCFAPAIKIDT 
GDNFELFGMNNNILVSKVCVGDDALDLRRRIMEHISDAIGRNVELADNRLNPHITHGKIN 
EGWGEWVSRFAPCNFLCKPREEIVFGGTKFIFGRVSNGNYVIKQPVDYV 
(SEQ ID NO: 131) 

Top Blast Hits 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 982 of CT513: this 
corresponds to nucleotides 76314 to 77295 of the genomic reference sequence. 

TaqMan Primery Probe Sets: 

5*start=502 

5 , stop=522 

3*start=559 

3'stop=578 

5'primer=TACCACCCAGATAATGGCCAC (residues 502 to 522 of SEQ ID NO: 130) 
Tm5=58.25 

3'primer=TGCTTGACTGCGCATCTTCT (residues 559 to 578 of SEQ ID NO: 130) 
Tm3=58.35 

probe 1«ATCTACTCAACCACAGGT (residues 537 to 554 of SEQ ID NO: 130) 

probe Is tar t=537 

probe lStop=554 

direct ionl=Re verse 

Tml-69.03 

scorel^l . 96 

length=77 



Sequences producing significant alignments: 



Score E 
(bits) Value 



032743 (032743) RECOMBINASE 

SYA_RHILV (P24 07 5) ALANYL-TRNA SYNTHETASE (EC 6.1.1-7) (ALA 

003942 (003942) LACTOBACILLUS BACTERIOPHAGE PHIG1E COMPLETE 

Q9X257 (Q9X257) CONSERVED HYPOTHETICAL PROTEIN 

022993 (022993) CELL DIVISION PROTEIN ISOLOG 

YK05_MYCTU (Q10851) HYPOTHETICAL 30.9 KD PROTEIN RV2005C 



37 
34 
34 
32 
32 
32 



0.23 

1.5 

2.6 

5.8 

9.9 

9.9 
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CT514 

Nucleotide 

Genomic coordinates: 

Start: 102885 

Stop: 100046 (SEQ ID NO: 132) 
Amino Acid 

MDSTSTTTIEAEKALLKEYVNENLTWEFVDRVIRHEKLMQRTDMRLKTSSRRLFSFISIY 

SFLQDFFTARDGVNSDEWCTQSALYHMLDGVASIISCFRKRIDYYNKKMERLACTSIREG 

YFLVDVKTIESRHVELLDPDKKIWQRLYAEKIAPEKVVDAYNEVSKLLPDEAMANYNYRT 

GLVHLSDTLKNAKKPPTDLTMTDFDFYEKYIRSDIVLGKSNKLSGMFSENFEILPDINIK 

VPRRLERYFNVETNYSLEHNFRFPSNHIRGLIFAYFIGNIFGGAFSCVQLYLLGFTLSAA 

SACRENVLDTPFSKLKQYIKNDNKTKNSSSNEDNDGEEYYPCELQYARINSNDKNACRKS 

IVKAVKFVADRVEKASVTMMRTPIAEHESDGYMADWLSLQISKLLGRKVSASYALLFIVN 

WVAHKYKQSFTNDVNGSEKYEILLKKLTVACGLTYNHKCGMVVPVIGFGSGMTNRKLRQY 

AVHC I ENVIGS FI SSGKRKKDI HEDPKKLEEMSLMQLS ARLFKNNDVMKRGQDGKVTFAN 

EDNVQDFLEELKTKEFVPNERRRKIHEEEYTKSLHTNLKMTFRFGVCGFQHPLPASSDKP 

TQVSLQLLKQRQTFVQRETAAAVNWTRLLQFLFPSDERDNKRHQNSLPWNRLGSNLNRHF 

ISLASKFIKRSVHCERWNDIISKFNADILPLGKDPDHFLMTKAGLVIEDHARENIDNAM 

YSLCGGFNNQTTEQKLNSIRLRISAEALKNARNCVLATTFSKSYNEDRPFLPRTDEAKFV 

PIPLFGVEPLHPLLNSFIDNTANKCNDSVSDFWLEESDDIFKEALVSHTILTDSSVYSTL 

VGEDEDYCDNNKSGKRIGNTLVCTLYDMMGRANYNGLHSDKPRKHDPTPWSSKNTGQSGR 

STTDFSPNSVIVLLDTENVADDYEDEEEDYEALKQSERDNVITLNNX 

(SEQ ID NO: 133) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q12537 (Q12537) GLUCOAMYLASE PRECURSOR (EC 3.2.1.3) (GLUCAN 39 0.11 

AM YG__AS PAK (P23176) GLUCOAMYLASE I PRECURSOR (EC 3.2.1.3) ( 38 0.32 

AMYG_ASPSH (P22832) GLUCOAMYLASE PRECURSOR (EC 3.2.1.3) (GL 38 0.42 

MSH6JYEAST (Q03834) MUTS PROTEIN HOMO LOG 6 36 1.2 

YMM1_CAEEL (P34489) HYPOTHETICAL 81.8 KD PROTEIN K01B6.1 IN 36 1.6 

AMYG_ASPNG (P04064) GLUCOAMYLASE Gl AND G2 PRECURSOR (EC 3. 35 2 . 1 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 3 to 841 of CT514 : this 
corresponds to nucleotides 99939 to 100777 of the genomic reference sequence. 

TaqMan Proe Sets: 
5'start=1126 
5 , stop=114 7 
3'start=1173 
3 , stop=1194 

5 " primer =TCCGTCACAATGATGAGGACAC (residues 1126 to 1147 of SEQ ID NO: 132) 
Tm5=59.41 

3»primer=GGAGAGCCAGTCTGCCATGTAG (residues 1173 to 1194 of SEQ ID NO: 132) 
Tm3=59.86 

probe 1-CGAACACGAATCAGATGG (residues 1155 to 1172 of SEQ ID NO: 132) 

probelstart=1155 

probe lStop=l 172 

directionl=Forward 

Tml-69.00 

scorel=l . 99 

length=69 
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CT515 

Nucleotide 

Genomic coordinates: 

Start: 114953 

Stop: 110132 (SEQ ID NO: 134) 
Amino Acid 

MSRNSLRVKGLKENGGI IPNPFDPLYVDTDAPFGMAGVKSDIIGKGFVESLLPGEISSHY 

NTFDCFKTPKKCRVGGNDFECISCRSLGGGTCVKSSRELKTEYGIEDDDEYDGVCVPLAD 

T I FS ASSAFDKHDDDVATDAAYRNVNPFTTVEEAYLH YESGGV I TGGGKKGTT Y I TKKRG 

CVDSSVVRKDPSLLNKDPRLEPILGCTDIVLCGGKGVGRPIHPTTFSI IDDVDDIDFDIS 

SMTSTMDCLCEPGYSQQRDPATNAPKCEKKEGGIQEKEQGLGCPVMFRYGVVGDTGTKGC 

LCDESTQIRLEEVAGIDLPDAAKTDYAQPFVEGAKLLLQITERYETLGGSTKDACLPRPG 

NDTRMSALGYSYAASLFGRAPEITAFNGGHLITGGLLRESAMDAAGNWHSRIEDSDEQGK 

LTVSESVGGVVPYSGTGSVAAHIWNGDALNDNGLVGAGGGNFTEHPNASLRWPLPHSNI 

PGLGI DSI DHAVGI I ASQGKI FPETVHMRAG DPSGVKTDRRDAHNDTT I ETS FLKDS DKA 

GYDSYKDNPLQKLRKSHDSGICATAYVVPSLHRVIKEKPSAKNDKTVNKILPLVHYRPTA 

KRMAHTPIETIFKHSLLTAQERDQSFANSTLNSMMVTNSSNSFDDVTNLLLDYFFPNLNG 

EGKERSGLPINTRSIYNEPNNAKFKEIGGIILQPVTAQGAKKSSTFARFSEKILSTNSPK 

IIDHYKAGSSAVFKKVGEKEAYEMFAHPPTAWRIASNEGTFFSGRGLNNGIEGTGMREAE 

RVAKTLSKKPDIFAGAILTGDGVLMNGASSPLVRPMEIPASSLPEHTWFERRSPVNARGD 

PGSADNLTAINNTYDRVTKGDIRAILNSTTDIKTSFNSYAPARPFSKPLAPPAGVSAAAQ 

ATSFLGVLGGFPLPIPSSFLIQKSVQESVSNGTVGSMHGIVPLKFHEGDELWQQCEVKET 

EGALNFIPPPMALFESLLRVRTLSSETFIRPELIPNRFRADWGLSPHTAGHYLNGVYSPP 

CVREETGQSFGYPCSGALSQYTTMMVPKPLGPQSHSSLSKFSIKSYVEEQTRLLPANIGE 

KSIFEMQDPNSKNIFDKIGELGEKENCNCTNGLFCPKVNGGGRNKTDPIAATPSRGNRHS 

RFPLMTTLPKNDVHLSAALLRAQSGDARILNTIGETKTNGRKINLKAATENIWDISSNVM 

LAPNKFCAMRRSTAYTPYSTRQEKVPAAVLDERKGTFDRNAELLGDVGMTDIVSNDILME 

DYERLPGVPPAEAEIFHIIRDAAKTGQEGAKARRIVDFFESSHGVTASTFNVGTFSPYVE 

GVKDIVSLYATPCFTDIDSPTISADSATINEGASIEPTDGSEVVVEVVNSNMEMLGGSTA 

GSTKKRRLSISDYVDLEEDAEVFTINKQGKATENLRVRTSSSSKYVEGGQKDMVGFYEAS 

KRVPRVMRRVHVLPVLTPYHGGFESCAPTAAQSACTRGVEITYADFMRPSDLSGTKTTLE 

GVRVKGPEPFDDLSTLYFRSVGGPNLRKFAHHHHFGYEGLMSRYYYTREKTVSVSEGDLK 

DRFPFVCQSDRGPFPPKRDGTIQPLALVDMGVLPEGALTRRTISME 

(SEQ ID NO: 135) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q24341 (Q24341) LARGE FORKED PROTEIN 37 0.96 

Q24340 (Q24340) SMALL FORKED PROTEIN 37 0.96 

013788 (013788) HYPOTHETICAL 59.6 KD PROTEIN C17G6.10 IN CH 37 0.96 

CHOD_STRSQ (P12676) CHOLESTEROL OXIDASE PRECURSOR (EC 1.1.3 37 1 . 3 

HAGA__PORGI (Q51845) HEMAGGLUTININ A PRECURSOR 36 1.6 

P94986 (P94986) HYPOTHETICAL 88.5 KD PROTEIN 35 3.7 



Comments : 
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CT516 

Nucleotide 

Genomic coordinates: 

Start: 118885 

Stop: 115402 (SEQ ID NO: 136) 



Amino Acid 

MKIVQNNFTPDERSGVIHIRKPAKIEKAVFGNIAAAIDDSAAVRKDPKKKRNLKNGLEPA 

SKKLAKNIERISSEELKRVTDVQDPKLLHSIMKRTARQIGYDIGDDISPQSAPDRDGSSS 

SSLLPIRMINIRTEELLEKGGKDTIVRIHILDGILPDNVPLPFKAEIKVDLVDEKYEGED 

GGGSSDSGPSLFETFPEFVPAGWPPITNDPNAFSRNNGNKQQAVFKHVEVNSLADGITLS 

TKGSIFNTGNRLKISIVTEDKNKTVLFDSQVTISSPIPKITEVFACRNVALMRLDMPKAI 

NYDNVEYTPDTLNEKYVSDYPANFPRLSRQAEIASNLAAKLPRENQLSDINKPSVSFVYS 

KTNTVNTPVLNKVLYNETLKNMEGNESEGYKILNATEITHLRNPSNPARTFICVSVPESE 

IEAQWKMLGWIVGFKTSSDVLTTSSGYNIVFPASKVTQSDKLFSVISTDVNANTNKVVVH 

NTPSRVGCFGSSVNFRVDAATAPDWPGPTNGPDFFSYQLRPCI ILKTDNDNREPRITAVL 

SSPATEYAGERTTSLLPRALNVSVGPLTEVRGGDIITPVQTALLGGEQPTFKAPAEPTKL 

YAVFPVLDSHNGLVKASDNPFQPIHSITSRNKTTVLTVSDVI VNDDDDDVVLEDKSYHIT 

VSDPVSGSILAKENVLSSRITSRPIFIDGARDDRVFSVKMEVFGGDDKGIQMPFTMDGHF 

EGQFSDMSVPSNELAIWNDPSTFTAPVRDTPATDITNKGIVYCRTTLPPISNRGIRDPFM 

KQTSLVPLPTSIPEWAFADYGGEIKYPRHIFISSIRTNDTTNI VNTDTQTEFSIENWLRE 

QIDKEQERHRQLLPAPSEAYTQGEKVYAKMYMGDGVSEETLDQIVHTSNTTYVVDESGTK 

KENLLVNKEDKKLAAILGKWGIVVFGANKYPDEPADRYTNWRNTGRLRAVGSYSQLRQPV 

APLQTRLATWPSGDPVTRLADGQFLVRLDPRCGGIGSANGFYNNNGANNEFTSSLLFAIV 

GNQDKVVSYAERVRFYMKIVARNEGKKHLKNDDGLVLVDRNSALHRRLWNRTTFDHDDIV 

LCVKI PQNVMSKIEPGTSSGVLVDPLVFANVASSTDREEFYKKFIDTSSGPVVIDRASVT 

SSYNISVPLNFYTTCGFIVG 

t SEQ ID NO: 137) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

DNA2_YEAST (P38 859) DNA REPLICATION HELICASE DNA2 37 0.68 

094 534 (094 534) PUTATIVE SPINDLE POLE BODY- ASSOCIATING PROT 37 0.89 

BAA84527 (BAA84527) ALP 14 37 0.89 

YGS4_YEAST (P46947) HYPOTHETICAL 30.5 KD PROTEIN IN SAE2-KE 36 1.5 

028907 (028907) GTP-BINDING PROTEIN 36 2.0 

Q49547 (Q49547) LMP3 PROTEIN 35 2.6 



Comment s : 

EST confirmation of the predicted transcript; 

An isolated EST has equence identity to nucleotides 1 to 888 of CT516: this 
corresponds to nucleotides 1154 94 to 116381 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 
5' start=1869 
5'stop=1892 
3' start=1984 
3'stop=2001 

5 ' prime r=CATTCACAGTATCACGTCAAGGAA (residues 1869 to 1892 of SEQ ID NO: 136) 
Tm5=57 . 80 

3 1 prime r=ACCAGACACGGGATCGGA (residues 1984 to 2001 of SEQ ID NO: 136) 
Tm3=58 . 39 

probe 1=AGCTACCACATCACAGTC (residues 1966 to 1983 of SEQ ID NO: 136) 

probe 1 starts 1966 

probelStop»l 983 

direct ion l=Forward 

Tml=69.00 

scorel=l . 99 

length=*133 
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CT517 

Nucleotide 
Genomic coordinates: 
Start: 180036 

Stop: 179421 {SEQ ID NO: 138) 
Amino Acid 

MEFGNLTNLDVAIIAILSIAI IALIVIMVIMIVFNTRVGRSVVANYDQMMRVPIQRRAKV 
MSIRGERSYNTPLGKVAMKNGLSDKDMKDVSADLVISTVTAPRTDPAGTGAENSNMTLKI 
LNNTGVDLLINDITVRPTVIAGNIKGNTMSNTYFSSKDIKSSSSKITLI DVCSKFEDGAA 
FEATMN I G FT S KNV IDIKDEI KKK 
(SEQ ID NO: 139) 

Top Blast Hits 

Sequences producing significant alignments: 

SCH9_YEAST (P11792) CAMP- DEPENDENT PROTEIN KINASE SCH9 (EC 
039292 (039292) COUNTERPART OF HSV-1 GENE UL10 AND VZV GENE 
HS78_YEAST (P32589) HEAT SHOCK PROTEIN HOMOLOG SSE1 
G435756 (G435756) LYSOSOMAL MEMBRANE GLYCOPROTEIN LAMP-2 HO 
LMP2_HUMAN (P13473) LYSOSOME-ASSOCIATED MEMBRANE GLYCOPROTE 
AX01_CHICK (P28685) AXONIN-1 PRECURSOR 

Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 711 to 1 of CT517 : this 
corresponds to nucleotides 179367 to 180077 of the genomic reference sequence. 

Hit to public sequence : 

gi |6856l60igb|AFi73992.i : CT nucleotides l to 615 match nucleotides 50 to 664 of the 
public sequence with a 100% homology, a score of 1219 and an Evalue of o.O 

TaqMan Primer /Probe Sets: 

5 , start=227 

5'stop=247 

3'start=294 

3'stop=311 

5 ' prime r=TGGCCATGAAGAATGGTCTCT (residues 227 to 247 of SEQ ID NO: 138) 
Tm5-58 . 19 

3 , primer=GTCCTTGGGGCTGTGACG (residues 294 to 311 of SEQ ID NO: 138) 
Tm3=58 . 60 

probe 1=TGCTGATCTTGTCATCTC (residues 273 to 290 of SEQ ID NO: 138) 

probelstart=273 

probelStop=290 

direction l=Forward 

Tml=69. 00 

scorel=l . 99 

length=85 



PCT/US00/28888 



Score E 
(bits) Value 

35 0.45 

33 1.7 

32 2.3 

32 3.0 

31 5.1 

31 6.7 
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CT518 

Nucleotide 

Genomic coordinates: 

Start: 190743 

Stop: 188172 (SEQ ID NO: 140) 
Amino Acid 

MTESKDYVIALVAETKTDEKRLNYVSEGLVAAISNLQNTPEKQRKVVISSDVFGPTWFNK 

TTEFFNSGLRLAKGHLSKDAVMRSVYRDIEGVREHIIDPSWRLTETAAEELCDFTFLKQA 

PLLNLLNAFENIMDGVFRSAANLVLYSTRGDTNEPSWVIDSEMLANRNNSTVADLAMGRA 

KRAIALFLGYTLCDILRWKQSIASRMKERGLDPFAAMPPHLEYGRAADMIEKRIKDFIEG 

SFSDGVTVSEEDGQSYVVPTISTVLTNMVSVIQEGFYPPKVGSFHEALLGREIMVLLSAA 

IDAEYRAVLSRTRNAKPNPLTTKLDKYVNNPHLQMPSESVTEREKEWVERERERIKTTDM 

TAENLFRDHPYLPKAIDGILGPKRTPTALQALQREYKRCNKFNDIVSPETLEYFLVNNRQ 

VMFSNYSVTRVLDPDSAARFSMYVLWNALFLCSGGLTQKTNSSAVKSRLILQVFLKDMHS 

LFVCQRCESGFITKSLDTFTISLKEQSKPSMGEQELETYWKAVLDALGGGGGNNKGAENV 

NGLGELMVEILSADSGLLRGGGIiGGDIGFEGKMKQKREDEEVRNMHLV DKKGYVFEAAKY 

VHVSKGFAALSFYLLYAAAATSNPSITNNFDRAVYLLLARWGDLKFPTHNLWGNVPTDEN 

TSSLLSFASFWALRNAVRARRNVIDASNTSFVPGRPLPIXSAFSSKMLVDNMLKNNYVKV 

ENVNREKLIWKAFREMQTESEIWKTSGSKAASDRNVKKAKQDLIRNASIGRLIVEPVGKT 

PISSIALFRSMKRSRSEDLKMGSNNKYRLARDTKTATPRNPLSYTGKI VFSLDDLKNFSK 

DSYTTMKVFPLTPLDG 

{SEQ ID NO: 141) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

CAA22155 (CAA22155) HYPOTHETICAL 46.4 KD PROTEIN 37 0.49 

095517 (095517) DJ1170K4.1 (NOVEL PROTEIN SIMILAR TO KIAA01 35 1.9 

Q43688 (Q43688) GLYCIN— RICH PROTEIN (FRAGMENT) 34 4.3 

YHO0_YEAST (P38800) HYPOTHETICAL 149.7 KD PROTEIN IN IRE1-K 34 5.6 

P97572 (P97572) CALPAIN SMALL SUBUNIT (EC 3.4.22.17) (FRAGM 34 5.6 

048591 (048591) GTL2 GENE 33 7.3 

Comments : 

EST confirmation of the predicted transcript: 

Nucleotides 515 to 1 of CT518: this corresponds to nucleotides 188146 to 188660 
of the genomic reference sequence. 

TaqMan Primer /Probe Sets: 

5'start=1308 

5'stop=132 7 

3'start=1383 

3 , stop=1404 

5 1 primer=AGCTGCCCGATTTTCAATGT (residues 1308 to 1327 of SEQ ID NO: 140) 
Tm5=58.2 6 

3 1 primer =CCTGCTTTTCACTGCAGAGCTA (residues 1383 to 1404 of SEQ ID NO: 140) 
Tm3=58.54 

probe 1=TGTGCTATGGAATGCATT (residues 1329 to 1346 of SEQ ID NO:140) 

probelstart=1329 

probelStop*=134 6 

directionl=Forward 

Tml=68.99 

scorel^l . 88 

length=97 
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CT1003 

Nucleotide 

Genomic coordinates : 

Start: 23709 

Stop: 24300 (SEQ ID NO: 142) 



Amino Acid 

MDVSSYKSTIDYHNIEDMDDLQRATYKDRMETELVLEMAKKEGRYVRSLATMDELEVPEE 
PATCYTCGYTFIRRRAPPPKRKSIFREPCAYPELLPDAPSPVRLEELVDVPEGASFFTYP 
PYDDGSSTSSSQAECEDDYPPPYDPSENPQRSQVCDYCTTRQVLSSMTDHARANLIKNLK 
REKKALGLGRRNNFSY 
(SEQ ID NO: 143) 



Top Blast Hits 

Sequences producing significant alignments: 

VP40_EBV (P03234) CAPSID PROTEIN P40 (VIRION STRUCTURAL PRO 
Q9Y4G2 (Q9Y4G2) KIAA0356 PROTEIN 

088508 (088508) DNA CYTOSINE-5 METHYLTRANSFERASE 3A 
Q9Y6K1 (Q9Y6K1) DNA CYTOSINE METHYLTRANSFERASE 3 ALPHA 
Q23804 (Q23804) SPID PRECURSOR (FRAGMENT) 
088799 (088799) ZONADHESIN 



Score 
(bits) 

35 
34 
33 
33 
33 
32 



E 

Value 

0.46 

1.0 

1.8 

1.8 

1 . 8 

3.9 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 736 of CT1003: this 
corresponds to nucleotides 23688 to 24423 of the genomic reference sequence. 
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CT1004 

Nucleotide 

Genomic coordinates: 

Start: 26630 

Stop: 27257 (SEQ ID NO: 144) 



Amino Acid 

MAPNSFQKFAPVIKTEKKEEERDEHDDPLRQIDFRDRKTLICLTANCVSRKRKAGSAHDR 
VYKVLR YGNPYKYRRPNRTHRGLALSMDQGEVGTCLPLRPMEETEEN PI DKCGVAFLYSN 
YNEGDGMTHLYNDEEYIKKCKTIEGGTRTWVKKNRQEYFRQALETLMMSHSIKQYSNFIF 
FKEDMEEGFVHKLHTFINMVHPKKVSVL 
(SEQ ID NO: 145) 

Top Blast Hits 

Score 

Sequences producing significant alignments: (bits) 

Q9XX10 (Q9XX10) Y39A1A.22 PROTEIN 34 

P70392 (P70392) RAS PROTEIN-SPECIFIC GUANINE NUCLEOTIDE-REL 32 

075381 (075381) PEROXISOMAL MEMBRANE ANCHOR PROTEIN HSPEX14 32 

Q40112 (Q40112) HYPOTHETICAL 28.4 KD PROTEIN 32 

ECEl_CAVPO (P97739) EN DOTHE LIN -CONVERTING ENZYME 1 (EC 3.4. 31 

O70651 (070651) GAG POLYPROTEIN 31 



E 

Value 

1 
5 
4 
4 
7 



1 . 

2 . 
4. 
4, 
5, 
5.7 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 654 of CT1004: this 
corresponds to nucleotides 26601 to 27254 of the genomic reference sequence. 

TaqMan Primer/Probe Sets : 

S , start=248 

5'stop=266 

3'start=291 

3'stop=309 

5 * prime r=TGGCCCTCTCAATGGATCA (residues 24 8 to 2 66 of SEQ ID NO: 14 4) 
Tm5=58.53 

3'primer=CTCTTCCATGGGTCGCAGA (residues 291 to 309 of SEQ ID NO: 144) 
Tm3-58.32 

probel=AAGTAGGAACATGCCTCC (residues 272 to 289 of SEQ ID NO: 144) 

probelstart=272 

probe 1 S top=2 8 9 

direction l=Forward 

Tml=68 . 97 

scorel=l . 97 

length=62 
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CT1005 

Nucleotide 

Genomic coordinates: 

Start: 31091 

Stop: 31961 (SEQ ID NO: 146) 
Amino Acid 

MEGEHQYLNLVREILERGVKKDDRTGTGTLSIFGPQMRFSLRDDTIPVLTTKKIFWRGVV 
EELLWFIRGNTDAKEIAKKKIHIWNANGSREFLDSRGLYDRAEGDLGPVYGFQWRHFGAE 
YDTCSSDYTGKGIDQLANILKTLRENPDDRRMIMTAWNPMDLHLMALPPCHMTAQFYVAN 
GELSCQLYQRSGDVGLGVPFNIASYSLLTHLMASMVGLKPGEFILTLGDAHIYNTHIEVL 
KKQLCRVPRP FPKLRI LMAPEKI EDFT I DMFYLEGYQPH SGNLQMKMAV 
(SEQ ID NO: 147) 

Top Blast Hits 









Score 


E 


Sequences producing significant alignments: 






(bits) 


Value 


TYSY_HUMAN (P04818) THYMIDYLATE SYNTHASE (EC 2 


.1. 


.1.45) (TS) 


410 


e-114 


TYSY_MOUSE (P07607) THYMIDYLATE SYNTHASE (EC 2 


.1. 


.1.45) (TS) 


407 


e-113 


TYSY_RAT (P4 5352) THYMIDYLATE SYNTHASE (EC 2.1 


.1 . 


.45) (TS) 


406 


e-112 


Q8994 0 (Q8994 0) THYMIDYLATE SYNTHASE 






390 


e-108 


P90463 (P90463) ORF 70 






387 


e-107 


DRTSTRYBB (Q27783) B I FUNCT I ONAL DI HYDRO FOLATE 


REDUCTASE-TH 


381 


e-105 



Comments : 



TaqMan Primer/Probe Sets: 

5'start=424 

5'stop=446 

3'start=479 

3'stop=501 

5 * pr ime r =ACCCTGAGAGAAAATCCAGATGA (residues 424 to 446 of SEQ ID NO: 146) 
Tm5=57.82 

3 ' pr imer=AAGAGCCATAAGGTGAAGATCCA (residues 479 to 501 of SEQ ID NO: 146) 
Tm3=57.91 

probe 1=ATGACGGCATGGAATCCT (residues 460 to 477 of SEQ ID NO: 146) 

probelstart=4 60 

probelStop=477 

direct i on l=Re verse 

Tml^68. 98 

scorel=1.98 

length=78 
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CT1006 

Nucleotide 

Genomic coordinates: 

Start: 32124 

Stop: 32802 (SEQ ID NO: 148) 
Amino Acid 

MAFNFEDSTNLFANMDLTAGTTTDPTRPNIIFFESLLPNSGIEVMKRRLVRQGKCGNFEA 
SGGAMSYFWLEDNAEDMENLNSGSHVKTNCLALFLQEFISNWIEETDRHGQYCTFPQYMD 
GGDGSRGGYFTSLAMKWMARDVTFFVFVDRNNTVENAASIWMYQKLLAIGAKVVKVIVDN 
ASNPMFSVCNACRCKYPGPVSYVIEGHGVGHSDLTCDEISGFFV 
(SEQ ID NO: 149) 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 624 of CT1006: this 
corresponds to nucleotides 32182 to 32805 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=268 

5'stop=289 

3'start=340 

3 , stop=361 

5 1 primer=TGCTTGGCATTATTCCTTCAAG (residues 268 to 289 of SEQ ID NO: 148) 
Tm5=58.01 

3 1 prime r=CGTCCATGTATTGGGGAAAAGT (residues 340 to 361 of SEQ ID NO: 148) 
Tm3=58 .72 

probel=CGACATGGACAGTACTGT (residues 322 to 339 of SEQ ID NO: 148) 

probelstart=322 

probelStop=339 

direct ion l=Forward 

Tml=69.04 

scorel«1.95 

iength=94 
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CT1007 

Nucleotide 

Genomic coordinates : 

Start: 32947 

Stop: 34216(SEQ ID NO: 150) 



Amino Acid 

MDSNTSILPPSKRPGLNLLQVLGI 1 1 TVALI ASVSS FI FYRVGKRKYYPSSSSSSELSDV 
DNGVEGGGGTTTTPTQPSPDGGDGYVDLSPQKKAELRTRVANVIFQEVSKDQGVAFRRAM 
NDSTDKIMEETEARINNFSEPFREATVEREVFKDDTDKNFILSTLDLTEEQFKDIVMAEV 
KNQLENFDYEDMTRLIFDNIPETDYLWTTHFDPKKYDTYSEKVLGFSDINSIERISSTFY 
KGKKYEVTTGNVAVLVDFESETIKEKAGNSLIRNVEFIVVDEQTYKSFFPAFNQVFFSFK 
VNKEKREVTVSINNGCVGIVANITPLTTPVGAASGHYIYGTSTAKEKTYLFVIDKYDTTE 
FVCGLSNKSTPLMALNILFMSDTVFPSFDEAERPLTDAKAVEILGKRLGVGRYTNANIRN 
TQ 

(SEQ ID NO: 151) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q56711 (Q56711) HOOK- ASSOCIATED PROTEIN TYPE 3 39 0.045 

077363 (077363) MAL3P4.5B PROTEIN 39 0.059 

YLM5_CAEEL (P34379) HYPOTHETICAL 49.8 KD PROTEIN D2007.5 IN 37 0.23 

GBA2_CAEEL (P22454) GUANINE NUCLEOT I DE-BINDING PROTEIN ALPH 36 0.51 

Q9YW03 (Q9YW03) ORF MSV089 PUTATIVE NTPASE, RABBIT FIBROMA 36 0.51 

P87199 (P87199) KINESIN MOTOR PROTEIN 36 0.51 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 907 of CT1007: this 
corresponds to nucleotides 33238 to 34144 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=648 

5*stop=674 

3'start=773 

3'stop*7 99 

5 * pr imer=TGACACGTACTCTGAAAAGGTATTAGG (residues 648 to 674 of SEQ ID NO: 150) 
Tm5=58 . 05 

3 , primer=CCTTCTCTTTTATTGTTTCAGATTCAA (residues 773 to 799 of SEQ ID NO: 150) 
Tm3=57.41 

probe 1-ATGTAGCTGTCCTCGTTG (residues 752 to 769 of SEQ ID NO: 150) 

probelstart=752 

probelStop=769 

direction l=Forward 

Tml=68.89 

scorel=l .89 

length=152 
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CT1008 
Nucleotide 

Genomic coordinates: 
Start: 34217 

Stop: 35048 (SEQ ID NO: 152) 



Amino Acid 

MEGVILDKIETIAKRASPSYGSIDVGTAILRRQFMEKIRGKINEETTMEKIMGTKEERED 
T I RS I VANVI KENTVKENVTEKI RAMT DKELN DNREFMH DFGKI STGDGGTFHLFEDTPG 
FESALKAEYKNVPGATTPKYVSMNSLRIDAINGKIEEVYNPSPIMGIREYGTIRRGRYEE 
NAGSKELVFMTKIEKRPNNVAENLI IRVANQQYNVMRMVFFI DYETKKGVSKEEMFIPYN 
VQKTKALKGRST YFS FVRKI PDEPEGS III H ALG FY 
(SEQ ID NO: 153) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

BAG_STRAG (P27951) IGA FC RECEPTOR PRECURSOR (BETA ANTIGEN) 41 0.007 

Q99051 (Q99051) IMMUNOGLOBULIN ALPHA FC RECEPTOR PRECURSOR 41 0.007 

YIBA_ECOLI (P24172) HYPOTHETICAL 31.8 KD PROTEIN IN RHSA-MT 38 0.079 

Q46749 (Q46749) ORF-A1 38 0.079 

Q25920 (Q25920) MATURE-PARASITE- INFECTED ERYTHROCYTE SURFAC 35 0.53 

Q06166 (Q06166) MATURE PARASITE- INFECTED ERYTHROCYTE SURFAC 35 0.53 



Comments : 

TaqMan Primer/ Probe Sets: 

5'start=361 

5 , stop=384 

3'start=429 

3 , stop=4 51 

5 1 prime r=TTTGAAAGTGCTTTAAAGGCAGAA (residues 361 to 384 of SEQ ID NO: 152) 
Tm5=58 . 35 

3 1 prime r=TCGCATCGATACGTAAACTGTTC (residues 429 to 451 of SEQ ID NO:152) 
Tm3=57. 96 

probe 1=CCAGGAGCAACTACTCCA (residues 397 to 414 of SEQ ID NO: 152) 

probe Is tar t=397 

probe lStop=4 14 

direct ionl=Reverse 

Tml=69.05 

scorel-1 . 94 

length=91 
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CT1080 
Nucleotide 
Genomic coordinates: 
Start: 291719 

Stop: 292205 (SEQ ID NO: 154) 
Amino Acid 

MTSPAPSPSSTPKSSCTTIVNRCGFLLDNNKEVVIYDTNSKFKCEPKNLELIGVLSGVSD 
NVVTQI SPDQI FVGTYMVKYNWSKSGHERFS DMSNNCLDNI TRPSEVIESVIKKTSSDFK 
MKYTRSLMDHTEKYYFSGDQKLSKISSWCTTPIRQWVCNSV 
(SEQ ID NO: 155) 

Top Blast Hits 

Sequences producing significant alignments: 

CAB4S519 (CAB45519) GIBBERELLIN 2 0 -OXIDASE -ARAB I DOPSIS THAL 
Q52743 (Q52743) CELB 

Q39110 (Q39110) GIBBERELLIN 20-OXIDASE 
060963 (060963) L549.2 

FLPA_ARCFU (028192) FIBRILLARIN-LIKE PRE-RRNA PROCESSING PR 
DPOE_YEAST (P21951) DNA POLYMERASE EPSILON, CATALYTIC SUBUN 

Comments : 

TaqMan Primer/Probe Sets: 
5'start=201 
5'stop=220 
3'start=290 
3»stop=312 

5 ' prime r=CCCCGACCAGATATTTGTGG (residues 201 to 220 of SEQ ID NO: 154) 
Tm5=58 . 70 

3 , primer=AGGGCGTGTAATATTGTCCAGAC (residues 290 to 312 of SEQ ID NO: 154) 
Tm3=58 . 09 

probe 1=CGCTTCAGTGACATGAGT (residues 265 to 282 of SEQ ID NO: 154) 

probelstart=265 

probelStop=282 

direct ion l=Reverse 

Tml=68.90 

scorel-1 . 90 

length*=112 
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Score E 
(bits) Value 

32 3.0 

32 3.0 

32 3.0 

31 3.9 

31 5.1 

31 6.7 
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CT1009 
Nucleotide 

Genomic coordinates: 

Start: 35073 

Stop: 35967 (SEQ ID NO: 



156) 



Amino Acid 

MALQEKDITIGNVSAALRELMYSPTHMQHHDKLNTFLDRNVESSSEEKIRQIVDKIRSQT 
TSDISETVNNVTTNGTAFSLFEDTLEGMVKKNIGDNLQSGDFIDGRKKLNDMKSLATGAI 
LSRQRDFVAESITGTKDWLKAIMGCGIIRYTVFVNNLARSTLDNDDDKAATYYNTPIYGG 
YCKMAIKDYEIPDSYSKVEAEHTVEGRKMTFNIKWRGDTINNLITIIPSVTGYLASISED 
ADVQAPLLLNCNNCFIEADMSSLYMDEKKTEASFTLNLPEIEGADANAVYEICIVVV 
(SEQ ID NO: 157) 

Top Blast Hits 

Sequences producing significant alignments: 

YB75__YEAST (P38321) HYPOTHETICAL 101.2 KD PROTEIN IN FAT2-P 

Q08281 (Q08281) CHROMOSOME XV READING FRAME ORF YOL138C 

Q92271 (Q92271) 12.8 KBP FRAGMENT OF THE LEFT ARM OF CHROMO 

Q26032 (Q2 6032) VARIANT-SPECIFIC SURFACE PROTEIN 

002179 (002179) CALYX PROTEIN 

001394 (001394) POLYHEDRAL CALYX PROTEIN 



Score E 
(bits) Value 



34 
33 
33 
32 
31 
31 



1.3 
2.9 
2.9 
3.8 
8.6 
8.6 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 780 of CT1009: this 
corresponds to nucleotides 35166 to 35945 of the genomic reference sequence. 
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CT1081 

Nucleotide 

Genomic coordinates: 

Start: 292189 

Stop: 298777 (SEQ ID NO: 158) 



Amino Acid 

MQLRLENFVKEEHETVVVHNPSGMTGFNIFNSSPVYFEVHNEMDALIFMAAFLKHNSLWG 
EINANMDLYTFDYAGAFLDERWCHHEKSFSVVRAQLINSYYKCRRKIMQALDNNYNNKNK 
KRKNVGGAPAFTFMSGDGEGGKEALEASFDVIGGTRGGRFGVDSTPCPHSSAMQLKLDNE 
GNYGCIACFASMFFVLENPGDESSFISTDASKIGQAQAWIDERLRNNENGGEENNVFKKT 
FHMLADITQKAHETAYSNTIPLGPNGRQWNWPTHTVEPIAHEFVTHSLVNTLKNLGDRKL 
PRFNFDILYNLLNPFGKMLLVFIQNCHILTGHKNNENVVPRGSASGKWWTINFVGVNMWT 
FQVTKCKVEKDRKISDLACMETLPRLPNPGSTTVDDRIVFKGFCRGENLGSVGEVVSDIT 
QSVKNFCLMVENRKFSVDKETGFISSESIVSDPFFSLEVTGCRSNRAQDTINNGRVSARV 
MRILKSREGARVWLAKDENAI I FENVNH DTAI STDAMERAIGQHKILYYDIETTDKDFTD 
KKSVITSIGFCLCTGGDMTHGGERGVFGLVAPGSDVEKVKETI INSYDPEEKEDIMKQCP 
QVIEIFTNEFEMLLGFGKYIDKVKPHVISGWNNVAFDDPFVFTRIVKHLSDHTKDMSYCV 
ADASTAESVLPRATEGGGGGETPYRLSTPQERIQLASTGIFNKLGKFVDKKTGMLKPEMT 
ADLLAGAESQANTKFKERNKLSSSNKGSAGWFQKIIGGMCSAIRLDLMKVCEKAYKESLS 
EFNLNAVLAKVSSVGDKVKNVKDEVDLHFHLLGFLKLKKAQDQAKVHVYCCKDAYLTGIV 
STSINKEGEIFRLCMDSALTEAVVTANLATPLCIGEGAICRNMGEERADRRGVGVRRHSI 
ATDTKGGMVSQPIVNHV PYQTIDMTSLY PMTMCQNNLCTTTFVTHRQIMQLRDRLVLEKM 
KNKTTDSLLLLDVIDECNQIVLSEYRPIDIAVASWKNSNSNRQTPITRIEESLGLRFIEN 
LDAEKTNNKTWCTNTSPNMNVTAAGMDYFPEIVCDINMQFAAKVNDDMHIAPASLEYMLQ 
VLPIMLIDRPYIGAHITAGKCRTLEDILSELEKDFSVEKDEEIIRTHWTFKGQKQYDFCH 
SPVTQMARHIIESTGRNIRDYEGNEKFERLVSLSDRI YRRVGAFDSANDPAVRLWSSRLI 
N VGMLVRTWN VKTDI LKG 1 1 PQMQAT Y RADR VVMQNKAKE FAKMGDMKRAGLN KVGQN I M 
KLGMNSMYGHLALRARSSRKEFASGSANTASSISNMSATGGIGGGTRHSVTANQITENAR 
CVFGNIGCGLQMALPGTKQTYGDTDSVFCVHNIVGDGGMIPEYDEQTGKYYYVMDIALKN 
KMAAIIPILVNSLTKGIQFVERRDAGVGMMNIAHERLAVAGLLFAKKTYHMLHFNENSAA 
FNDMIKLKSTDNNNKFASFIKRPSHADGYVVPHNPSLILRAAEGPAGKKLKSFLEEEGIH 
DEKSMEEWFTSSPTWMAMDASVINNLYASQIVGVEKGNWIDAMTSRPIEAGTEMMEAVTQ 
ANAAFTPYKKGAFVKKGITPTTKLKGLQSLI ARFLPKIEEKKSCYLDVMKNHVENFASHI 
TNPAMMITSSRVNKFDTSKEQSRPNPLALAINNHLNPSSEISLGQKFKTVTSVSSWSLSA 
EEGEVPAGYFNAGSVRWDATNMKGSVPAFSVKNLSVVPNAITSVYKMVESDKTAIKSMIA 
KNVEVLCSTSANTGFSLRRGALSFNTGVIVTKDVAMACIRSLNNKQMLLFVGGGKDYGED 
DDDDDEEAEEEDEENGENEENKGDCVTEKKIPGRSTNKDVGEETKTSEKTEGERKGSKTA 

kgkteeiasslskcgkkdardvildrllkathssctnneertrvlqqysnctlssyitsv 
mkldqrvadqmenh sqldqi rnlsnkkrqekggpfkseldamvaavkvkffp vldasrk 

LTQDHWKKCPVSIPETREEKPLMGVPFEVALNSLIGKHKCTDTCDMACCQSLYFVLLYTL 
ALKFENERLARQIGLDDSVDLMAEMLFGGDKLLAQEVLKRVKDAQDRKLVKSLLPLNYNH 
DTNTIIFLFESLRFAQKPVAGMSVSEI KDAVRGLAFSTTTGTVWNYTDERFFGPLYNMDE 
LCNE RVNGNCKLS FITGIYH TAAVE LAAAC LSCVL 
(SEQ ID NO: 159} 



Top Blast Hits 

Sequences producing significant alignments: 



DPOD_YEAST ( PI 5 4 3 6 ) 
DPOD_SOYBN ( 04 8 901 ) 
DPOD^SCHPO (P30316) 
CAB58156 (CAB58156) 



DNA POLYMERASE DELTA LARGE CHAIN (EC 2. 
DNA POLYMERASE DELTA CATALYTIC CHAIN (E 
DNA POLYMERASE DELTA LARGE CHAIN (EC 2. 
DNA POLYMERASE DELTA LARGE CHAIN 
UBF1_RAT (P25977) NUCLEOLAR TRANSCRIPTION FACTOR 1 (UPSTREA 
UBF1_HUMAN (Pi 74 80) NUCLEOLAR TRANSCRIPTION FACTOR 1 (UPSTR 



Score 
(bits) 

52 
51 
50 
50 
50 
49 



E 

value 

5e-05 
8e-05 
2e-04 
2e-04 
2e-04 
4e-04 
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Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 4 to 1205 of CT1081: this 
corresponds to nucleotides 297608 to 298809 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=2738 

5 , stop=2760 

3»start=2817 

3»stop«2838 

5 1 prime r=TCGTCAATCATGTTCCCTATCAA (residues 2738 to 2760 of SEQ ID NO: 158) 
Tm5=58. 05 

3 1 prime r=TCGATGGGTCACAAAGGTAGTG (residues 2817 to 2838 of SEQ ID NO:158) 
Tm3=58. 56 

probe 1—GATGACCATGTGTCAGAA (residues 27 87 to 2B04 of SEQ ID NO: 158) 

probelstart=2787 

probe 1 S top=2 8 04 

direction l=Reverse 

Tml=69. 00 

scorel=1.99 

length=101 
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CT1082 

Nucleotide 

Genomic coordinates: 

Start: 300935 

Stop: 305108 (SEQ ID NO: 160) 



Amino Acid 

MTEQGDQGIKVRKLHGPRGERGETGPAGAVGPAGPQGERGAIGPAGKDGAVGPAGPQGER 

GAIGPAGKDGAVGPQGPPGERGENGRPGRDGAVGPQGERGAIGPAGKDGAVGPQGERGAI 

GPAGKDGAVGPAGPQGERGENGRPGRDGAVGPAGPPGERGAIGPAGRDGAVGPAGPPGER 

GATGIPGRDGVDGSVGPQGERGEIGRPGRDGAVGPAGPQGRRGATGRAGKDGAVGPAGPQ 

GEKGEAGKDGSIGPQGIQGPRGETGPPGRDGTAAERGERGFPGPPGETGPPGKDGVDGSE 

GPQGKRGETGPVGPRGEPGLAGLPGRDGAIGPAGPPGERGATGLPGRNGVDGSIGPQGRR 

GATGRAGKDGAVGPAGPPGERGATGIPGRDGVDGSVGPPGERGETGPAGRDGSVGPAGPH 

GERGENGRPGRDGATGPIGPAGPQGEKGENGRPGRDGATGPIGPRGETGAMGKNGVDGSM 

GPQGRRGATGRAGKDGAVGPAGPPGERGETGPAGRDGSVGPAGPQGETGLTGSPGRDGAT 

GPIGPAGPQGEKGENGRPGRDGATGPIGPAGPQGEKGENGRPGRDGATGPIGPAGPQGET 

GLTGRPGRDGATGPIGPRGETGAMGKNGVDGSTGPQGRRGATGRAGKDGAVGPAGPPGER 

GENGRPGRDGATGPIGPAGPQGETGLAGLPGRDGAIGPQGEKGENGRPGKDGATGPMGPP 

GERGETGPIGPAGPQGATGLPGRDGVDGSVGPQGKRGLIGRTGRDGAIGPVGPAGPKGET 

GLAGLPGIDGKDGSVGPQGAIGPIGPRGERGETGRPGRDGEDGSTGPMGPQGLRGATGAP 

GPQGERGLKGRPGKDGETGPPGRQGRDGIMGPRGLRGEKGAPGNDGLEGPEGRDGAPGPA 

GPIGPQGIRGLKGIQGRPGRDGEMGPAGKDGIEGPRGQDGTTGAKGPRGLRGFQGRTGET 

GAQGSRGEKGDRGLTGPQGRDGPPGEEGPQGLRGERGAPGPRGPRGIRGRSGPQGSNGVQ 

GPRGPRGTKGRTGIQGLTGIEGPRGPRGIQGKEGRMGKIGHRGEKGDKGDRGEQGIAGAD 

GEKGPRGLRGIRGPIGAPGKPGTEGVRGPRGVRGVPGYPGAQGELGPQGPTGPQGPAGPQ 

GPMGRTGDTGPMGPPGAVGPRGEKGGRGRKGKNGPKGADGKDAVNIIQKYSITHARAEIM 

WEGNEIGEAYIGRSYGTDTIPVMIENRIGMTNEDKKNEYCIQVMTMHSITTRGRTSGVFV 

VSNKTDYILLVTLLMPESVSCRTDVSTNARSERVNAVRERESKSYRFIRPSDQSIGTHSR 

SKIAVVMYPDASMSYSVDTLDADVARRETTSVLLLAETIHGEKDRGFYADRGTVGRLMVP 

PTEEELLVLQX 

(SEQ ID NO: 161} 

Top Blast Hits 

Score E. 

Sequences producing significant alignments: (bits) Value 

Q14054 <Q14054) COLLAGEN TYPE VII PRECURSOR 940 0.0 

CA11_CHICK (P02457) COLLAGEN ALPHA 1(1) CHAIN PRECURSOR 935 0.0 

CA17_HUMAN (Q02388) COLLAGEN ALPHA 1 (VI I ) CHAIN PRECURSOR ( 933 0.0 

Q63870 (Q63870) TYPE VII COLLAGEN 928 0.0 

Q60444 (Q60444) TYPE VII COLLAGEN (FRAGMENT) 925 0.0 

CA11_HUMAN (P02452) COLLAGEN ALPHA 1(1) CHAIN PRECURSOR 923 0.0 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 557 of CT1082: this 
corresponds to nucleotides 304552 to 305108 of the genomic reference sequence. 
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CT1083 
Nucleotide 
Genomic coordinates: 
Start: 50094 

Stop: 5029S(SEQ ID NO: 162) 
Amino Acid 

MAVTEI PCGT RN I AE E DVELE L I LVTAE AE VRE AMAAAL AAA 1 1 GAV VVQ I GRV L DE VVA 
AEVELM 

(SEQ ID NO: 163) 
Top Blast Hits 

Sequences producing significant alignments: 

LIPB_MYCTU (Q10404) PROBABLE LI POATE- PROTE I N LIGASE B {EC 6 
Q64033 (Q64033) ANTIGEN LEC-A 

OL56_STRAT (Q07017) OLEANDOMYCIN POLY KE TIDE SYNTHASE, MODUL 
RPSD_PSEFL (P52 32 6) RNA POLYMERASE SIGMA FACTOR RPOD (SIGMA 
054540 (054540) RNA POLYMERASE SIGMA FACTOR 
Q9YAS2 (Q9YAS2) 136AA LONG HYPOTHETICAL PROTEIN 

Comments : 

TaqMan Primer/Probe Sets: 
5 * start=24 
5'stop=4 5 
3 f start=95 
3'stop=112 

5 ' p rimer =TGGTACTCGGAACATTGCAGAA (residues 24 to 45 of SEQ ID NO: 162) 
Tm5=59 . 21 

3 'primer^CTGCTGCCATCGCCTCTC (residues 95 to 112 of SEQ ID NO: 162 ) 
Tm3=59. 19 

probel=TTGTAACAGCAGAAGCAG (residues 71 to 88 of SEQ ID NO: 162) 

probelstart=71 

probelStop=88 

direct ion l=Reverse 

Tml=68.98 

scorel^l . 92 

length=89 



PCT/US00/28888 



Score E 

(bits) Value 

30 1.8 

29 3.1 . 

29 5.2 

28 6.9 

28 6.9 

28 9.0 
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CT600 

Nucleotide 

Genomic coordinates : 
Start: 1118 

Stop: 491 (SEQ ID NO: 164) 



Amino Acid 

MHMWGVYAAILAGLTLILWISIVVTNIELNKKLDKKDKDAYPVESEIINLTINGVARGN 
HFNFVNGTLQTRNYGKVYVAGQGTSDSELVKKKGDIILTSLLGDGDHTLNVNKAESKELE 
LYARVYNNTKRDITVDSVSLSPGLNATGREFSANKFVLYFKPTVLKKNRINTLVFGATFD 
EDIDDTNRHYLLSMRFSPGNDLFKVGEK 
(SEQ ID NO: 165) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

051578 (051578) EXODEOXYRIBONUCLEASE V, BETA CHAIN (RECB) 39 0.033 

068195 (068195) DIOL DEHYDRATASE -RE ACTIVATING FACTOR LARGE 32 3.2 

BACC_BACLI (068008) BACITRACIN SYNTHETASE 3 (BA3) [INCLUDES 31 5.5 

TKTJiYCPN (P75611) TRANSKETOLASE (EC 2.2.1.1) (TK) 31 5.5 

Q9ZER8 (Q9ZER8) REPAC1 PROTEIN 31 7.2 

CAA10001 (CAA10001) REPAC PROTEIN (FRAGMENT) 31 7.2 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 569 of CT600: this 
corresponds to nucleotides 513 to 1081 of the genomic reference sequence. 
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CT520 

Nucleotide 

Genomic coordinates: 

Start: 209342 

Stop: 196799(SEQ ID NO: 166) 
Amino Acid 

MFKANVLNLGGGKFLESDVRDHLIKCANQMKEEPTTLRICLSNKLPEYDNRRLPLLLLNE 
GEQILVTDNLTKNGNPLVKQMGHLAVQDRVGGDGSVNPNNLLYAGCNVVEYDTVNRGNDG 
KLIMYSQPATLKDVAKSECKKGMYKVKKVPEITGDQFLDKLNERSCQNENRRMDEEGPHVG 
TGKLLRELIIMMRLYEEETSSAEKLCVTPAFREFLGCGRTATDVPVFKVAFITNASLMGL 
KVIFYPTILEEERLAAVSDTENVVLLKSILKVQLELLSECMPRIVERVESMIKKTVACFK 
I DIGGS DNWNLPGHCKVSDTAFPYHHAQLVGEKKNT LS I SNENMVTSLGVVKADRAEEWM 
CKTLESFEKKCLYLENLMGSMANTDDWRRKILFSELGPEMPYRNKSLIMDQDFCTIGMCY 
KFLAEGGGLLLTKTNATLLKEKMACKGLDDSGDGDDEEEDNEEGGSGGKSGGGSGDENNI 
NKPPPAPKQIPPLAANVYNSIINDDDKLDQIVCFFKRKHGFFLSDIDNSPLLAMEFLLPQ 
KAMSKKNCVERVKPETKNI IRNLTGVNT I KFDT I MP FA I LQI VVRYENRNLKLPRDTDI L 
QQRLKNNTWDALSKGKFAEMWQFTHKESLKPPTIEELESIPPPPTQSEEEAAAAAAASTS 
STTPDMVSSLEEGATSTSSSDENQIASLENIKKLLSI ITSTFATGADPCNDTIFAWTVVTL 
AERFCALYNITSHPEEYYQQIIREDFEFEGGFEKFRHMCDAINRELSIYVPKSVLEKQSV 
CRMGVAAYENSMERIKNKTNSKLCKIKYDESTMVYELNNDTFKTFDYDESDKS FGPMYEC 
APMETFQRLFASVKSDKEAVLADKKSEKREKLYQQKQEYLRKCDNDDVSARQILNNVASN 
ESDEESDEESDDEENYGAAKGGATGDYYGGDDEDDCYGFLGEFGSSDDENVPSDNASSIN 
NVQDDVFRDVNFIKT FNFRSSLCHRQKYVSTVI VEEMEKNLCDVLTLDNSAAESGDILKE 
INRRSLRMRNWVVPFTMPVREIVKPNVNSEDGTANSNNNIPPFCSCASLNNFKSDSPLSS 
NNTMSNEKCIKLLPIPSSKHLKDLTVALRFNTMACERRYFS DVTAALGFVKKDKVNGNIR 
SILDNKRWDAIKQCKLAGKCLSSALPLGIYENVISEDNKLINTFRPRSLARLACSSGGDG 
VSDKSVNNGFFSGIWALCANQDLESVVLGSTVVDPLKPTKVFNQSLSEKELKEKRQQMCL 
DAANYFKDHNVSKLNIYECFKMMEECIMRTALNGKTSNDSEFFSNLITRYGSGTNSPASR 
LWTILETVRECFNNSLPIDWGSLVKDW DGSDMLNLKAGVSNVDESGAVFELSEFLGVSAR 
AFFGKDLDTNLDADTWECLLNDDNKDWKAQVAKAYEFALKDNDIRSVENFINSSNLLTNN 
NVIKKLKIKPTPSNDVRHQIWVEDEYYPRNKSTLRSRAEWMAATEEVLKTEMSLSCVLAM 
VAMYRIMMQGESVREIATAPLRLSVDKMVPLIRCFKITSKWCSCTGKGDSPKKADASIKE 
GRFYDIEEDPLHFYRFAAYVIGQVASNDIVIEEMTRKILMSFDFNGFDTSNWLQFITYRF 
SHVLMGRRSRLLSRPLSLVKNLVSVSSLADKNSEKSNDMYEKRVGKVMKRIARLVLVKAA 
DSVRASSNDLLDCCILDVNDVSVKSLDEFRAKTRQELQETRIDTNYNLVSNSCTTAQLAA 
VEKSSRIINTNISFHNIPAGQAKVMDANEEAFIDPSLEEINKEDNSGAKQMTGKGGSNRG 
RSKKSGGGGFNNAGGFYNDDSSRGSSSVVDEDSRSRTGFSQIHMDARNEEDRESGLFSYD 
GYVLNRIKNMITQNQINNDIVKVISDIENFFKICVPFSKKEYALYGVTETALSAGMDAIE 
RWNKAVEEETNKIRKECRDLTDTGSVYDMNI ICPGDYMSSVGEGGNGGCGGGSSSSGHLL 
SNNNNEANQTNEISEDQLKHEGSDCSFWFNFYKKVVNSSEKKQGKGSVLANTGHEGRIVG 
RPLRT FIQYKGKGFAETKVLTR Y FSN H I S DS YWSQVM P I CY I KNMALGDEDKS KKKFGKR 
PWKNFNNNSNSSSNSSVKYVSIQDLEKKDSLKNVPMGYDEDLLSLYDDSLTTSTEKLENI 
KIVNDSKDAYVILGSSNQSSFDQTFSQQYFTHQKISNINTYKSLGKMWNCNNGMSPKNQI 
VLLKKLLFKNLN ILW I KLYERH I SVLCNWGC I H PNS S KNSHFEMTKNNAPCGVTDSN PPL 
SVYHSGFLSVEDYGQLLKDTFPLMNLHRTFSAKSKDNNSSDPSPEKISAASLAKAVYARE 
VLSSCLDPEGNFCTSWITNSCSVLFTPGTNIRRGGDFFNKSCYRQQDNDYCFIGKEETKK 
CPNFVSSEIEIVSILKTAVFLSTNSDGHKRVLRVINYNKDHSGLYAGIDTGCADDEDDDD 
DQGGTDKTCLLQEDSMDAKRMLISMRSVINGKSLDESSLAIKKDNFNFLAGTDKGFYLDN 
SFFNSPVQGKFVAPRGTKIFKKCCDFLLNKGTGGV FARIFFTDWACIVSSSKGKNNKKAI 
ESTLQIRNGGCFSERLTPSMFDNESEQGELFHDRYCPDFLSDYNKQNIFSEQAYKCSFLA 
NPVCPAKNMLKRAKNIRLCITNAGTALISKIMAEVEKMGNARTFISNGTAIPFRLAENTA 
CISVDNNRYFLIDGTYLLGGRLEGINLVTDMYTRCKLKAEKHVILNSLFSTEFISAALAS 
SMEGTTMGRGLCLIEHVSYMKNTDSVSNMNKNFWSMAEDQEETDENEDDDDENEEDEDEN 
EENTENTSWKYEPVSKTAFSSSLKPPSIFIADEDYI FLSILYELAKATSDCETASSSSS 
SSSSSSSSSKHSSSSSSSNKKRKQKDDVNSTTTALHALRKCYISCVDQKTGMPRMDVVYL 
LRGLMNFGGMCTAIASGDGEKAHHMVQTLCSVALNIATKTAVVFVGTKGNNLKTTLVDLC 
KRTWFERFTNINVTALNNAGDSSSSTQANLASFAGKKGIVI IDEVGHQGSFGSKKSSSED 
DKDESASRSGNVDFGGSGGEMNSVDINEARNAYGDGGNSKIVFSNINRLMTESKLKVCDQ 
EYDFISELKHEKNRKNACN DTKKRKRGGEIEDEGVECEEIERNDGKN DENGVRIKDPINI 
SFFARKAHWWNCSSGVVSTTFKEKNIVYNMLHRGAMPFSIKDCTDSPWLNETDAVYRHCK 
KPIEYEGKFSKSEVKTALKCILGKFGSKICDNESFESI IDENCQVNNLHSWNDCKEDIDE 
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WNEKFMSKNKKKKQNMKIEDKVDAIMNIIQKNNGLLKWNTSFDRDGSPVLVCNPATERFS 

EMITS SLSAQDMLEIKKYLGDNCLSTNGGVKKS VI DGNTSAPGVLIAYHCVYTGKISDDL 

SKTNYPVLLPPPKKQHFVAVDDAAEKALLGPTLSNINI DSIRNIKTISRKLSSIIKDPEA 

AKLLVDRDLDFMNMYERYDASLFDVVKKPSKYSFPGFTSDGSVVLSTSTSDCENVLSCLK 

KRIEKDKMSAKNSGSFIRMCMDKNLLSDEKDDSSSNSSKNTSSLPKTDDNSSDIANFLSV 

FGENRQQSSQFSFASNSSGGGDSNKEACFNVDTPKRRQLVSALQKHNSDGSSSIITEIAK 

AIPQKNDVSSSITKHMLPGQFPSSLLKNMTSPQNSVMIRGI FQQGAKSSITVSPIMMSNS 

YIFSFFVDEAMSKRLIVFPCDTTFVFENKNEDVKKIIGLLDRGMKYIHSSLMMERCIKFG 

KHGIKQRQHEFNHHKKAWNDFSGHSSDNKKKDRISDVSSVLPSVLMKNLIRNKVLELRDV 

KSVSRLEENTNTFFHLYTSMSLCAKAATNYGESSSSSATITEVEEDNSCDAEEQQLRRKK 

PANYESMCNKLPSPLQMCQINPKSLNTMAMNIARSRQGAWAQLNSMLNSVLFVEMPFVKT 

TRFFGRDFNIKMHSPATKNRPAINFDNCIGMSLPNPDMDVVGYDKEGELIGVGSSLTKHL 

CDAWGSMDVRDLMYSCHHLHMLFEMALQYTECKRRLSSLKTLKSDKTGVDYVAVMLACMV 

YQLMVSNLKYPVFLSSSSHKRANTEDIADENQVSSLSVPMFLAMVVNKPLHALRHSTNLA 

LPNASQKSDHSDIVKYIVMNQWGLRLNPDYLCPNCVKHVL 

(SEQ ID NO: 167) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

SR40_YEAST (P32583) SUPPRESSOR PROTEIN SRP40 54 2e-05 

Q9Z1T1 (Q9Z1T1) AP-3 COMPLEX BETA3A SUBUNIT 54 2e-05 

AAD56625 (AAD56625) NUCLEOLIN-RELATED PROTEIN NRP 52 7e-05 

P90493 (P90493) HERPES SIMPLEX VIRUS TYPE 2 (STRAIN HG52) , 52 9e-05 

P70475 (P70475) NEURAL ZINC FINGER TRANSCRIPTION FACTOR 1 ( 50 3e~04 

095815 (095815) DENTIN PHOSPHORYN (FRAGMENT) 50 3e-04 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1064 to 1 of CT520: this 
corresponds to nucleotides 196778 to 197841 of the genomic reference sequence. 

TaqMan Primer/ Probe Sets: 

5*start=7375 

5 , stop=7397 

3*start=7450 

3 , stop=7472 

5 , primer=GATGACGATCAAGGTGGTACAGA (residues 7375 to 7397 of SEQ ID NO: 166) 
TmS=57 . 66 

3 ' prime r=CCATTAATGACAGACCGCATAGA (residues 7450 to 7472 of SEQ ID NO: 166) 
Tm3=57.78 

probe 1=CAATGGATGCTAAGAGGA (residues 7424 to 7441 of SEQ ID NO: 166) 

probelstart=7424 

probe 1 S t op= 74 41 

direct ionl=Reverse 

Tml=68.98 

scorel=l . 98 

length=98 
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CT601 

Nucleotide 

Genomic coordinates : 
Start: 1511 

Stop: 1196(SEQ ID NO: 168) 
Amino Acid 

MYLSHIRQTPLVEERRALTFKMYHHNNNNQHSFVNCQCRRTSSSINCSSCSRETFNSVKA 
IQYFNKTSRNNTAHHFKMPASKDRNYSSFEYAETAVAAHNISQW 
(SEQ ID NO: 169) 

Top Blast Hits 

Sequences producing significant alignments: 

YT66__CAEEL (Q11082) PROBABLE G PROTEIN-COUPLED RECEPTOR B05 
000885 (000885) MAP KINASE KINASE PROTEIN DDMEK1 
CHD1_HUMAN (014 64 6) CHROMODOMAIN-HELI CASE- DNA-B IN DING PROTE 
097292 (097292) PFC0965W PROTEIN 
096226 (096226) SER/THR PROTEIN KINASE 
096563 (096563) C-13 ANTIGEN (FRAGMENT) 

Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 353 to 3 of CT601: this 
corresponds to nucleotides 1184 to 1534 of the genomic reference sequence. 

TaqMan Primer/Probe Sets; 

5*start=96 

5 , stop=116 

3 , start=142 

3'stop=162 

5 ' primer =CTTTGTGAATTGCCAGTGCAG (residues 96 to 116 of SEQ ID NO: 168) 
Tm5=58. 03 

3 * pr imer =AGTTTCACGGGAACAGCTTGA (residues 142 to 162 of SEQ ID NO: 168) 
Tm3=58.43 

probe 1=TCTTCCTCCATCAACTGT (residues 124 to 141 of SEQ ID NO: 168) 

probelstart=124 

probelStop=141 

direct ion l=Reverse 

Tml=68 . 98 

scorel=l . 98 

length=67 



Score E 

(bits) Value 

32 1.4 

30 4.2 

30 4.2 

30 4.2 

30 5.5 

30 5.5 
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CT521 

Nucleotide 

Genomic coordinates: 

Start: 272387 

Stop: 268691 (SEQ ID NO: 170) 
Amino Acid 

MSKSSSSTVKSASFFNSLMENAPSSKIELLEDGWTKKAAAADTDTPTAKPTGLSISLMDI 
SGSMGSVKSAVADSCSGIMATLNVIAPGIQNAIVYYNDFDKHSIESGPWRAPDCSEWEG 
GDFVKHMRKTEVCGGGGGGSEALHSSLMYVFNNMIPAFKKMHGITRDEKFPILI FVFTDE 
DVRIANSDTGKLCANSYDSETAPEEEFIMKTWGQKPLTILDMRKALVENDCWLRILNFSR 
CSGSNQSELCQEDVINFSGYDNNRWQLFESFDRRSCNVRKNIATFIMRQSISLFKNLNDQ 
FSAFPILREINQEELNVFIESEGRSEPAGFEKYGDAQRESFKSRVLNMAPLDFGRVVQGG 
GRYNNHKRSVFLNCAY DSAFCCSKQTFNPQQQQQQQQSSSGGGGISKLAVVTQRAQSITG 
GGN7\ASTLALHMNACFQSLDDFGIDHTNLCDCKGCTKLMASVEATSDQGRKTKLSRKYAR 
VHWAKMFAEKLFKMMIKEQSMMYACSAVPDEIGAIYAFVTGNNAGVCSRVSTILSDLGTE 
CGNKAEYAFLKEGKHMKSASYDALQVINNTDLTPEQSSMFMWFYVPNDALEEAGKIFHQS 
FS FSNSYTGGGLLSLDEYKR FEFGQCFDFIKKLVSCLKITRNVE DVLLETSKTSNRYFAI 
PVFCGSDDQKEVLREELASDLFGGREDVAEMMFIDLETVIQKLGTLYDVRLSLPEGGYAA 
I KSVCAAASWAASCEVPSNTSNMI LS I AKMAFTKYYQEQNSSSETDLDI ILPS I LEGTAD 
GEIENNLSGVVFLRCLITWANKIGVDKNFTNKLEHFLALRILTKAGDSKIGEKYETFPVR 
RLDLSEKDLKYICKRCGVKSLKMEYDNDEKLCLRCKGNYRMGKPMVYHWDNKLTRDPRAK 
TASPTTLNLLNAKKI DDKVKEMASDI IGALNLPPTDKDNEIAVSAAAKAVGILYGKTCLL 
YKLLNEGNIDIPVAVCVECDCCKSKYMMSTLGPDKPQNRKCPWCRYANKLVAMGRGGKKL 
LMDLIECGAPSLAMVEEAIRTSGDVMYEELGEGEEFYIIDYFLKLKNTAIAEGNKLQQNN 
NKRPAPLQVTSPSSPPKKMRSDLPDSLLT^AIGECAIETKEKTTVNLIGLGEVKVVENVGP 
NDLDGKDPFISLQEYCSWDKFNSLFVNPWLGYRLDEQWDDWNTFLIHVKKNDVWKFLCNK 
TSPFSVVVMN DGSGLLNVDNVNVLVRQKICV 
(SEQ ID NO: 171) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

AAF04634 (AAF04634) HYPOTHETICAL 45.2 KD PROTEIN 76 le-12 

Q9Y1T1 (Q9Y1T1) DNA POLYMERASE ALPHA CATALYTIC SUBUNIT (EC 36 1.3 

P91805 (P91805) ARYLPHORIN GENE -SPECIFIC BINDING PROTEIN-2 34 6.4 

045322 (045322) DY3 . 5 PROTEIN 34 6.4 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 895 of CT521: this 
corresponds to nucleotides 268736 to 269630 of the genomic reference sequence. 
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CT602 

Nucleotide 

Genomic coordinates: 

Start: 2996 

Stop: 2702 (SEQ ID NO: 172) 



Amino Acid 

MDILEDIYKSAITLVLQSPEFVNDVKQEASQVVEGLIPSIREAVFRRLLEEERKKHEDEV 
GDVEDKRQAVIDKANTMITTMAAEYLESVDILEEFGF 
(SEQ ID NO: 173) 

Top Blast Hits 

Sequences producing significant alignments: 

CRCM_HUMAN (P23508) COLORECTAL MUTANT CANCER PROTEIN {MCC P 
ACVS_PENCH (P19787) DELTA- ( L-ALPHA-AMINOADIPYL) -L-CYSTEINYL 
ACVT_PENCH (P2604 6) DELTA- (L-ALPHA-AMINOADIPYL) -L-CYSTEINYL 
Q9XJCS (Q9XJC5) HYPOTHETICAL 26.4 KD PROTEIN 

NOSZ_ACHCY (P94127) NITROUS-OXIDE REDUCTASE PRECURSOR (EC 1 
EZRI_BOVIN (P31976) EZRIN (P81) (CYTOVILLIN) (VILLIN-2) 



Score 
(bits) 

36 
33 
33 
32 
32 
32 



E 

Value 

0. 055 

0.48 

0.48 

0.62 

0.82 

1.1 



Comments : 



TaqMan Primer/Probe Sets: 

5 , start=121 

5'stop=144 

3'start=200 

3'stop-222 

5 , primer=AGAGAAGCTGTCTTTAGACGGCTT (residues 121 to 144 of SEQ ID NO: 172) 
Tm5=58.14 

3 ' pr imer=TGCCTTGTCTATCACTGCTTGTC (residues 200 to 222 of SEQ ID NO: 172) 
Tm3=58.14 

probe 1 = AACACGAAGACGAGGTGG (residues 164 to 181 of SEQ ID NO: 172) 

probelstart=164 

probe 1 S top=l 8 1 

directionl=Forward 

Tml=69. 14 

scorel=l .85 

length=102 
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CT522 

Nucleotide 

Genomic coordinates : 

Start: 276736 

Stop: 275206 (SEQ ID NO: 174) 
Amino Acid 

MASGFAIKGIVKNYRRIPSIIESIKSIRRSELAEGVYIVSLHKNTPKHEVDEIVNKIRLS 

AGNPCLEKTSLFLQHHSQMRNFYTRKGAESESDWLKRLPEDLRNINNIVKREALPHDKSF 

T FSP LYRILT DRLFNAAI HNCKY 1 1 VTADLLMGCG ITNNKVEKKLLSMGS I LGGESMVPL 

HDIAHRLSYKGLRIENPIVGSCHDQCLVVPVSMLGKI FSSNMYPTFKNFDQCMALFLNAV 

VTHSAEKMDGKHERNKVIHMPNEVYLDAARRKYLEEKLEETNKLDAIDEEAREEYGNEIG 

RIGDKSTCLVFALSARDFFLTNRFNEDTPLYSGTERGIRFMCSNYCTMRDEGGFRPRLIM 

SAYGPTSYPIIFNTLYDQFNVQYYPCVSGVVLSFIGDDQLAPEPESLVDIWRSIKNPSI 

RIFSGDGETVYQDGRRVDVGGEGKNQKFNREERTILNVLRIIKAYNEERTKEDEDEEEEE 

EEEEEQQTAATVTVESDWDLSLERGENWV 

(SEQ ID NO: 175) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

061310 (061310) TSJ5 44 0.003 

CINA_ELEEL (P02719) SODIUM CHANNEL PROTEIN (NA+ CHANNEL) 42 0.009 

MYC_BRARE (P52160) MYC PROTEIN (C-MYC) 41 0.019 

Q9Y0C6 (Q9Y0C6) J5 PROTEIN (FRAGMENT) 41 0.019 

035788 (035788) CYCLIC NUCLEOTI DE-GAT ED CHANNEL BETA SUBUNI 41 0.025 

MYC_CARAU (P49709) MYC PROTEIN (C-MYC) 40 0.033 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 374 to 1 of CT522: this 
corresponds to nucleotides 275340 to 275713 of the genomic reference sequence. 

Hit to public SBV sequence: 

gi|6007410|gb|AF178573.2 : CT nucleotides l to 552 match nucleotides 555 to 6 of the 
public sequence with a 99% homology/ a score of 1063 and an Evalue of o.O 

TaqMan Primer/Probe Sets: 

5*start=682 

5'stop=703 

3*start=767 

3 'stop=787 

5'primer=AATTTTGATCAATGCATGGCAT (residues 682 to 703 of SEQ ID NO: 174) 
Tm5=57 . 93 

3 • primer=CGTTTGGCATATGGATGACCT (residues 767 to 787 of SEQ ID NO: 174) 
Tm3=58.39 

probel=CAGTTGTTACACATTCGG (residues 716 to 733 of SEQ ID NO: 174) 

probelstart«716 

probelStop=733 

direct ionl=Forward 

Tml=68.99 

scorel=1.99 

length=106 
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CT603 

Nucleotide 

Genomic coordinates: 

Start: 24906 

Stop: 24660 (SEQ ID NO: 176) 
Amino Acid 

MTCPEISKHISGTDRRFWNTADPGGLSYPFNPLFTLHLHLKNFSKIFSAHSSLGGGPLTR 

PYVKFEGWTAGSTQRQITERS 

(SEQ ID NO: 177) 

Top Blast Hits 



Comments : 

Hit to public SBV sequence: 

gil 6l65655|gb|AF099i42.i : CT nucleotides 118 to 231 match nucleotides 5652 to 5539 of 
the public sequence with a 90% homology/ a score of 139 and an Evalue of 5e-36 



Sequences producing significant alignments: 



Score E 
(bits) Value 



CAB49803 (CAB49803) HYPOTHETICAL 69.0 KD PROTEIN 
074312 (074312) PUTATIVE TRANSMEMBRANE 



31 1.8 
29 5.2 
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CT523 

Nucleotide 
Genomic coordinates: 
Start: 281865 

Stop: 281127 (SEQ ID NO: 178) 
Amino Acid 

MVSTRSMEAKAAAAAKAKEVSPTTSKRKAEDLTEGTEEEEESVETHPPSKLPRVDEDEVY 
I DENVDGDVQI LASS I EVARMERERLAEAMVRDI KIEEEKAATEARKE I ASRLI YKEMV Y 
LLPQLENMTNRLRPRSLLRHNEMTITDRTFSDLQIFNKVTFEFPILTDIAFLAREKSRVE 
GSRFYNDMKIGPITAYKLNLMCNKFIESWQKVKAEISPFVEVSVSSELEGSPFWDFKQR 
IVKHT 

(SEQ ID NO: 179) 
Top Blast Hits 

Sequences producing significant alignments: 
Q23915 (Q23915) PROTEIN KINASE 

077819 (077819) CORNEAL EPITHELIAL RHO-ASSOCIATED-SER/THR K 
Q134 64 (Q134 64) RHO-ASSOCIATED, COILED-COIL CONTAINING PROT 
P70336 (P70336) RHO-ASSOCIATED COILED-COIL FORMING KINAGE 2 
P70335 (P70335) RHO-ASSOCIATED COILED-COIL FORMING KINAGE 1 
Q63644 (Q63644) RHO-ASSOCIATED KINASE BETA 

Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 1 to 712 of CT523: this 
corresponds to nucleotides 281143 to 281854 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5 , start=404 

5'stop=42S 

3 , start=502 

3*stop=522 

5 1 primer =GATCACTTCTCAGGCACAACGA (residues 404 to 425 of SEQ ID N0:178) 
Tm5=58. 93 

3 1 primer-ACGGGCAAGGAAAGCAATATC (residues 502 to 522 of SEQ ID NO: 178) 
Tm3=59.06 

probe 1=AATGACC ATT ACAGACCG (residues 426 to 443 of SEQ ID NO: 178) 

probelstart=426 

probe 1 S top= 44 3 

direct ionl=Reverse 

Tml=68.98 

scorel=l . 98 

length=119 



Score E 
(bits) value 

43 0.001 

43 0.002 

43 0.002 

40 0.017 

39 0.029 

39 0.029 
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CT604 

Nucleotide 

Genomic coordinates: 

Start: 50300 

Stop: 49079(SEQ ID NO: 180) 



Amino Acid 

MSHINSTSAATTSSNTLPICTTTAPMIAAARAAAIASRTSASAVTSINSNSTSSSAMFRV 
PQGISVTAMPPVPALTSLTESTGTRMSSTPNVDVIPVPGPKNKSKSKKKDSKRKKNQNGN 
RSSDEDEPSLVIDDGSGRQSKNKKYSWVTSLATTTAERNNDTLAPPRPFLPTPEEGNMSE 
IDAGLSNPVTRQITGEVYSAALTSGVGDNGLYPSHFTVADTSYGDCETPIPGPAFVLDDG 
TVSRGTSLLHREEAEFLNDGSKVIHTVKPRNSKYSNIQRAASCMAYAVDLLNNHNITSDQ 
FDFMAMTAWAARQRCGEMAKFFEKRDKDIGEYRNKVVQYNRGIFTRTTEMNKRAKIILEQ 
QQRREAAAAAAATGATAPI PTTS AAGVGATS SATTNSLE YQE I R YQ 
(SEQ ID NO: 181) 

Top Blast Hits 

Sequences producing significant alignments: 

AMYH_YEAST (P08640) GLUCOAMYLASE S1/S2 PRECURSOR (EC 3.2.1. 
AAC49609 (AAC49609) GLUCOAMYLASE 

039781 (039781) MEMBRANE GLYCOPROTEIN 

039782 (039782) MEMBRANE GLYCOPROTEIN 
Q9Y075 (Q9Y075) PROTEOPHOS PHOGLYCAN (FRAGMENT) 
Q14888 (Q14888) MUCIN (FRAGMENT) 

Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 1 to 666 of CT604 : this 
corresponds to nucleotides 4 9128 to 4 9792 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=572 

5'stop=592 

3'start=640 

3 , stop=658 

5 ' prime r=GCCAAATCACCGGAGAAGTTT (residues 572 to 592 of SEQ ID NO: 180) 
Tm5=59.06 

3 ' primer=CAGCAACCGTGAAGTGGGA (residues 640 to 658 of SEQ ID NO: 180) 
Tm3=59.24 

probe 1 =CTCACTTCTGGAGTTGGA (residues 604 to 621 of SEQ ID NO: 180) 

probelstart=604 

p robe 1 S top= 62 1 

direct ionl= Forward 

Tml=69.03 

scorel=l . 96 

length=87 



PCT/US00/28888 



Score E 
(bits) Value 

55 8e-07 

55 8e-07 

53 4e-06 

50 3e-05 

48 8e-05 

48 8e-05 



FIGURE 3, Sheet 98 of 160 




WO 01/38351 



PCT/US00/28888 



140/201 



CT524 

Nucleotide 

Genomic coordinates : 

Start: 291298 

Stop: 289681 (SEQ ID NO: 182) 
Amino Acid 

MEDFKQLKVKNGICLSGENTENYERVLLTFKSVKSVRRSELKEGHFIVRLRDKEVLHIKN 

GNERLRQLTGDPTLQIGLKYTSSLPKQGSFLEDEDPNYGKKWNESLPSPFQEMNKIVEEK 

ALVNDKNFKFSPLYRI IHERLSNAAVKKCDYMI I TT DFLVGCG FS PRNCTRTLKNMEQVL 

VQHGGTSSRVSVYDICDRLTYNGLSIANPIVGSFSNMCLIVPMDKLGLLFYNSTHPSAKS 

IGNYMSCLFNAAWYTLEKSNQKLDNFEKEIRFAKNEVNLLVSERSVLEEKLKESKKLYA 

ASEEQRISLRDVHKKSSIASSRYDGGACLVFAFSDRDFSLLCRTNGNGSFYSATEEGIRY 

VSSDDYRKRDVDERRPRLVMSITGSDAPICIRDSIRNHFNNHFIASGKGNEISFIDPPNE 

RLLMEMVREVTGSDIKIFMDNGKVYQDGVEIKVIDPSSKEGKDI IKKEETLPEEERKRLR 

RERRMIFNTVKAIETYNEERGEEEEVATSSGGTKRKREEKEGDYVALLNKACKEIKVC 

(SEQ ID NO: 183) 

Top Blast Hits 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1642 to 1 of CT524: this 
corresponds to nucleotides 289650 to 2912 91 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=723 

5 , stop=74 6 

3'start=828 

3*stop=852 

5 • pr imer=TGGAAATTACATGTCATGCCTTTT (residues 723 to 746 of SEQ ID NO: 182) 
Tm5=58.47 

3 ■ pr imer=TTCGCTAACTAGAAGGTTGACTTCA (residues 828 to 852 of SEQ ID NO: 182) 
Tm3=57 .59 

probe 1=TGCTGCAGTTGTATACAC (residues 750 to 767 of SEQ ID NO: 182) 

probelstart=750 

probelStop=767 

direction l=Forward 

Tml=68.96 

scorel=l . 96 

length*=130 



Sequences producing significant alignments: 



Score E 
(bits) Value 



Q57105 (Q57105) FESMID CLONE 31, COMPLETE SEQUENCE 
094986 (094986) KIAA0912 PROTEIN (FRAGMENT) 
094944 (094944) KIAA0866 PROTEIN 

MYST_HUMAN (P35749) MYOSIN HEAVY CHAIN, SMOOTH MUSCLE ISOFO 
YDH6_SCHPO (Q92351) HYPOTHETICAL 140.8 KD PROTEIN C6G9 . 06C 
Q22705 (Q22705) T23G7 . 3 PROTEIN 



39 
39 
38 
38 
38 
38 



0.10 
0. 10 
0.23 
0.23 
0.23 
0.23 
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CT605 

Nucleotide 

Genomic coordinates: 

Start: 51809 

Stop: S0423{SEQ ID NO: 184) 
Amino Acid 

MDSSASVVFMRFAPPGEETALPPRRATPGSVAYDLFPSEEMDIEPMGLAKISTGYGIDKF 

PDGCYGQIVSRSGMTWKNNTSVPTGTI DVDYRGELKVILRNHSAEKSVPIRKGTSIAQLI 

FLRYCDVEEEQIVYINETTGERTIIDSSSKKDNKNQARSVRGTGGFGSTDNPNFTETTVS 

RNQQEENKKEELEEGEIVEMEGFIDIPFLEGFENILAEQSNETGVTYPNTNQDVEEKDTK 

NIDVVRELEAEFSSGIGSGSMDSSDSSDSSSSSSDSSDSSDSSDSESSDDSEGGDNKVRR 

IRRHQYHRRQLSYSDDVNGGGRNSEKMEMDRVTHIKTEHIKREDEPRYEERERYIHPRRM 

QVPKDYYCEQYEHYDAPAAAHHHRHHQHRHQHQRHFNQPRSNNSSDVTAYVNENSPTRPC 

RDRNSRFSERPNNGGYNRINSRYTTFDPYRYGARRGRGGVY 

(SEQ ID NO: 185) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q89662 (Q89662) COMPLETE GENOME 90 2e-17 

Q86612 <Q86612) ORF2 89 8e-17 

DUT_CHVP1 (041033) DEOXYURIDINE 5 * -TRI PHOSPHATE NUCLEOT I DOH 88 le-16 

DUT_YEAST (P33317) DEOXYURI DINE 5 ' -TRIPHOSPHATE NUCLEOTIDOH 87 2e-16 

DUT_LYCES (P32518) DEOXYURI DINE 5 ' -TRIPHOSPHATE NUCLEOTIDOH 83 3e-15 

DUT_HUMAN (P33316) DEOXYURI DINE 5 " —TRIPHOSPHATE NUCLEOTIDOH 83 5e-15 

Comments : 
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CT606 

Nucleotide 

Genomic coordinates: 
Start: 65024 

Stop: 64010 (SEQ ID NO: 186) 
Amino Acid 

MSSSQGLNNNMCTTEILLPKCTSSSLSLEESVDYLEKDFEELGIPLVEGKEVLLEFAYKI 
LNKRDTIRVIGDEQGDVCSVFFLRFGKKKTFNPQTKMWLVKLANAIALSMGVVPEPACTC 
SRMMTTAKKIPVPESYKNVNRNIQKFEDVHYIDINFQSFVREQIGLSVLGKNDVQKKKKE 
ETPFFAPFNKSKIGGECIEDLKYDSESVSIIRDVFNLLGEMPTEDVKTSRSCINPSHNDT 
NPSMRLVFRPMYWRNSKLVMDKLSKEQDSALIEKYMGGEHQHCI IGGRNVLLYCITALCF 
SSDCGFKKMLTNDEIKQLIWYLVLLFFHI ICPI IQSK 
(SEQ ID NO: 188) 

Top Blast Hits 

Sequences producing significant alignments: 

YNHE_ECOLI (P77522) HYPOTHETICAL 56.3 KD PROTEIN IN LPP-ARO 
TTK_MOUSE (P357 61) DUAL SPECIFICITY PROTEIN KINASE TTK (EC 
Q56333 (Q56333) FLIL 

CAB49508 (CAB49508) HYPOTHETICAL 32.8 KD PROTEIN 
BAA85006 (BAA85006) ORF1P 
BAA85071 (BAA85071) ORF1S 

Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 531 to 1 of CT606: this 
corresponds to nucleotides 64063 to 64593 of the genomic reference sequence. 

TaqMan Proe Sets: 

5'start=546 

5'stop=567 

3'start=655 

3 , stop=677 

5 1 prime r=CCCTTTCTTTGCACCCTTTAAT (residues 546 to 567 of SEQ ID NO: 186) 
Tm5«57. 43 

3 , primer=ACATCCTCAGTAGGCATTTCACC (residues 655 to 677 of SEQ ID NO: 186) 
Tm3=58.14 

probe 1 =GT ATGATTCTGAGTCTGT (residues 606 to 623 of SEQ ID NO: 186) 

probelstart=606 

probelStop=623 

direct! on l=Reverse 

Tml=68 - 79 

scorel=l .79 

length=132 



PCT/US00/28888 



Score E 

(bits) Value 

35 0.69 

34 1.2 

34 1.2 

33 3.5 

32 6.0 

32 6.0 
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CT607 

Nucleotide 

Genomic coordinates: 

Start: 68659 

Stop: 65032 (SEQ ID NO: 188) 



Amino Acid 

MGVPEAKKVYENAYGAQNGRVIKEKTGYEDCYDDEDDEDYCSGEEDCTTSSLLKATSLAN 

INSKNFLDFGRGKKSSSSSPTCDYTLDMVDLPTYNVSDLVMLGRQIATTMLKGQKNMGQM 

ILFINTTNQQIIDVLHDGFNVIREE DTMHSRMQNKKHI YENFYCRDEKKVISEFFSRKYK 

HEKIKARIERVPI IIPSSQEEVDWLTEPPIEDMMMAPPVSNHKMDDYEGLDYWINKHTDV 

MKKRKFLTNSFLFRNVPTTSFNSSPTAVLKSRFKDAFFASQMEGVILYYAFRMIRVMKNL 

LKSKNLKGRYTVLFTDGKAPAIKMMTRAKRQIRQERSKEKAKSRNENCLNRKTNDLLFYS 

CERMMMRLPQGLMASALLDIMRIPVLKTTGSKCMYLSNASFTEAEDDIVRLTSCLLNLET 

PGKHFSLLEKRKILEYDSYNMSGNRKESKRWEDLLNVLKQHTNDENQTLSMNLFSHDSDV 

LVKWNLMVGHHKNVCRLTGTQFKDSETFLKIGHVKFFRCMNSNSSGENQANELGGFAAKR 

RTKPNTIYNLAESPLMLSPESTLLIMLTKGSDYNSAIVSNCEYDTWVRKEVAVFENTYCT 

CVGGWEIFLSEQEARKNNKDCDDSVGNISMGNLSKSNCRKCDKKLVLPFWTIKFFYLSQA 

IDFVRDPLQLCFPPTHLIDLETDVSLKHALHRALAVNAAANVMSYLTMGSFNQRVFGTIT 

TLSDI S I HLSGANNNESKNTGS DVES DTEDLI PFSNNKRKSGNDPQKS TRKKSKVNATRK 

SAPVTKKLSSSVFESIRGFFESHTEGGIINDRGILTKERIDVFGNNLDTNPEALGEENGG 

GGGIVSSIPGLSTEQTSILKTEQNNSTSDFLDFFKKFNEMDDVEEEEEKMEEGEKEEEEA 

DLETDDWLDEARKAFEYKDSDFLEAVTAATNEMTSSLAKNNIEEDEHSRCSVSSKLNNKQ 

PVMDEEKWAEIVNEFDKCISLDNITYNDNSLLSRLSGVLMDANKREDGNNSNVVLYEPVQ 

GIDDERFSGVPYSVKTMNLLVIVYMNMCGLEDNTIVYQQLMPIIHSEFCGKTEEDKICTD 

RTNFMSAALEYTMLQYMPELKKTPRIKQIKRKNWERIPKVLDDFKDKVSTCTDNYNKLLA 

TLNKEGKIPSENTKWLPSQGQFMPVLGVAISKPWSPLTLWSSFYLQHQQRQDVSLTNITP 

PNSPRPEQ 

(SEQ ID NO: 189) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



Q12532 (Q12532) HYPOTHETICAL 119.1 KD PROTEIN YPL009C 

Q9XZI5 (Q9XZI5) RANGAP 

Q13387 (Q13387) HYPOTHETICAL PROTEIN 384D8_2 

Q9YTL7 (Q9YTL7) ORF 4 8 

Q25662 (Q25662) REPEAT ORGANELLAR PROTEIN 

096266 (096266) HYPOTHETICAL 283.6 KD PROTEIN 



50 
47 
46 
45 
45 
45 



le-04 
8e-04 
0.001 
0.003 
0.003 
0.004 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 860 of CT607 : this 
corresponds to nucleotides 65102 to 65960 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5 , start=1428 

5'stop=14 4 8 

3'start=1516 

3'stop=1538 

5 1 primer=CGATTCTGATGTGCTGGTGAA (residues 1428 to 1448 of SEQ ID NO: 188) 
Tm5=58.31 

3 * pr imer=TGGCCAATCTTTAAGAACGTTTC (residues 1516 to 1538 of SEQ ID NO: 188) 
Tm3=58.35 

probe 1-TGGAACCTGATGGTTGGA (residues 1450 to 1467 of SEQ ID NO: 188) 

probelstart=14 50 

probelStop=14 67 

directionl=Forward 

Tml=69. 04 

scorel=l . 95 

length=lll 
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CT608 

Nucleotide 
Genomic coordinates: 
Start: 69118 

Stop: 68704 (SEQ ID NO: 190) 
Amino Acid 

MDGDSSSLQLLSESEFDYVVETLKEQGVWELALEVFNEVSNSIETVKEEEDYTVLRSRNY 
FPTES I TLYKQQQEEEESTP I KKRKLASGKS PRSLCRELRLLQ I PSTTTFKAAPRSSSRR 
GKNTRLRRVCKNYGAHQ 
(SEQ ID NO: 191) 

Top Blast Hits 

Sequences producing significant alignments: 

VE2_HPV08 (P06422) REGULATORY PROTEIN E2 
FLP_ZYGBI (P13784) RECOMBINASE FLP PROTEIN 
Q9XZS0 (Q9XZS0) GUANYLYL CYCLASE (EC 4.6.1.2) 
CINA^ELEEL (P02719) SODIUM CHANNEL PROTEIN (NA+ CHANNEL) 
GLMS_HELPY (026060) GLUCOSAMINE — FRUCTOSE- 6-PHOSPHATE AMINO 
Q9Z8C4 (Q9Z8C4 ) TRANSGLYCOLASE/TRANSPEPTIDASE 

Comments: 

TaqMan Primer/Probe Sets: 
5 T start=114 
5'stop=138 
3'start=235 
3'stop=259 

5 ' primer =GGTGTCCAATTCTATTGAGACTGTG (residues 114 to 138 of SEQ ID NO: 190) 
Tm5=57 . 77 

3 * primer^CGAGTTTCCTCTTCTTAATAGGGGT {residues 235 to 259 of SEQ ID NO: 190) 
Tm3=58.39 

probe 1 =AAC AACAACAGGAAG AGG (residues 209 to 226 of SEQ ID NO: 190) 

probelstart=209 

probe 1 S top=2 2 6 

direction l=Forward 

Tml=68 .86 

scorel=1.86 

length=146 



Score E 

(bits) Value 

38 0.028" 

34 0.41 

33 0.71 

32 1.6 

32 1.6 

32 1.6 



FIGURE 3, Sheet 103 of 160 



WO 01/38351 



PCT/US00/28888 



145/201 



CT609 

Nucleotide 

Genomic coordinates: 

Start: 78365 

Stop: 77441 (SEQ ID NO: 192) 
Amino Acid 

MWCSTHLSYSEFFTPLQSQKLFRNFFRALEFRGWTASSTECQVPRVDLWVGPMSDYTRNC 
WFQKRTLTFVCFWNRRFWRLVDPEMRGYNLLFSLEN FTLPLSQKLFKNFFRALQFRGWTA 
SSTECQVPRVDRWVGPMSDYTRNVIAPETYINFCVFLEQAFLETGRPRNERVYPSVFTRE 
FYSSSISKTFQKFFRALQFRGWTASSTECQVPRVDLWVGPMSDYTRNVIAPEIEEVSYGH 
FWTRCFWTKILLDGNPLPLPPPFKKGPRVYNDCTTPHSNHHNHHHHHGRTSILQQTLSRK 
WSSLL 

(SEQ ID NO: 193) 
Top Blast Hits 



Sequences producing significant alignments: 



Score E 
(bits) Value 



DYRK_RAT (Q63470) DUAL-SPECIFICITY TYROSINE- (Y) -PHOSPHORYLA 
DYRK_MOUSE (Q61214) DUAL-SPECIFICITY TYROSINE- ( Y ) -PHOSPHORY 
061543 (061543) BHLH-PAS TRANSCRIPTION FACTOR SPINELESS 



37 
37 
37 
37 
36 
36 



0.22 
0.22 
0.22 
0.22 
0.29 
0.50 



DYRK_HUMAN (Q13627) DUAL-SPECIFICITY TYROSINE- {Y ) -PHOSPHORY 
HXAl_MOUSE (P09022) HOMEOBOX PROTEIN HOX-A1 (HOX-1.6) (HOME 
060275 (060275) KIAA0522 PROTEIN (FRAGMENT) 



Comments : 
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CT1010 

Nucleotide 

Genomic coordinates: 

Start: 41348 

Stop: 41795(SEQ ID NO: 194) 
Amino Acid 

MTHLVLLILSLSLSPVYHHLTPYLSPHLTYTPISPITSIFPHLIHSLQFQHPVLAEPTHN 
QIWTPVFPFI PNRHHLCPQALAVYIRRRGQARSISSLQASRRATQQALSLLLPRRDLPIL 
KLQEW P LQP P PHQVLT PCWTLS Y LVLRN 
(SEQ ID NO: 195) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

062081 (062081) C31A11.6 PROTEIN 36 0.20 

Q05330 (Q05330) PUTATIVE ORF 32 2.2 

Q60974 (Q60974) RETINOID X RECEPTOR INTERACTING PROTEIN 13 32 2 . 9 

P79230 (P79230) KAPPA CASEIN (FRAGMENT) 32 2.9 

Q9WAL8 (Q9WAL8 ) POLY PROTE IN 31 3.8 

AAD12852 (AAD12852) Y8A9A.2 PROTEIN 31 6.6 

Comments : 

TaqMan Primer/Probe Sets: 

5'start=209 

5'stop=227 

3'start=280 

3*stop=300 

5 ' p rimer =TCCCTAACCGGCACCATTT (residues 209 to 227 of SEQ ID NO: 194) 
Tm5=58.83 

3 1 primer =GCTGGCTTGCAGAGAACTGAT {residues 280 to 300 of SEQ ID NO: 194) 
Tm3=58.29 

probe 1=TAT AT AAGGCGGCGCGGC (residues 250 to 267 of SEQ ID NO: 194) 

probelstart=250 

probelStop=267 

direct ion 1= Forward 

Tml=69.O0 

scorel=l . 99 

length=92 
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CT1011 
Nucleotide 
Genomic coordinates: 
Start: 41758 

Stop: 42097 (SEQ ID NO: 196) 
Amino Acid 

MLDSKLLSSEELKELTSYVSTSSRRSDMKKHLLHLFEEHEKIFQFIQGKHKFSLYTLDFE 
IFYVMLNILLVEVKNILSPIPLLFDRNLQPVRRLWMFHNGPASPERCSRSLG 
(SEQ ID NO: 197) 

Top Blast Hits 

Sequences producing significant alignments: 

044068 (044068) DNA POLYMERASE ALPHA (FRAGMENT) 
Q49563 (Q4 9563) DNA, TRANSPOSON-LIKE ELEMENT ENCODING 3 ORF 
BAA74885 (BAA74885) KIAA0862 PROTEIN 
044055 (04 4 055) DNA POLYMERASE ALPHA (FRAGMENT) 
076063 (O76063) LEUCINE-RICH REPEAT PROTEIN SHOC-2 
YD89_METJA (Q58784) HYPOTHETICAL PROTEIN MJ1389 

Comments : 

TaqMan Primer/Probe Sets: 
5'start=107 
5*stop=127 
3*start=193 
3 f stop=217 

5 ' printer =TCGAGGAGCACGAGAAGATCT (residues 107 to 127 of SEQ ID NO: 196) 
Tm5=57 .83 

3 f primer =CTTCAACCAACAAAATATTCAGCAT (residues 193 to 217 of SEQ ID NO: 196) 
Tm3=57.4 6 

probe 1 =ACAAGGTAAGCACAAGTT {residues 138 to 155 of SEQ ID NO: 196) 

probelstart=138 

probelStop=155 

direct ionl=Forward 

Tml-69.01 

scorel=l . 88 

length=lll 



Score E 
(bits) Value 

31 2.3 

31 3.0 

31 3.9 

31 3.9 

31 3.9 

31 3.9 
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CT1012 
Nucleotide 
Genomic coordinates: 
Start: 42053 

Stop: 45491 (SEQ ID NO: 198) 
Amino Acid 

MAPPHLNAAADLLDKVMSGPLSPEGAQTSSPAACVGAKVVKALVSFCQKTRFTTNIVMRE 
VKAMEFQGDDFNYSALCASMPQRPVTERQMFALMKSEDEEMGVSANFSPVSDDVINPSSL 
PSGQEVDSSTSAQISGMFQNVWSLLEECGSGSNSNSSPVSRTVLVCTLFIIQVFKFLVTK 
VSNVNVLNQLFGHVVFGSLDVAPSNNNSVPSTVVNNNNKPSTSNNSNNISNKRVGGSNNS 
GGGRSKKVTATAKNPFNNVDGDNHGMFAGAPVDVNLDDFVFPQVETLTSKSTIPKEEVNV 
DEDLSKMCRKTALTPLEIHTFNVFISEINPSKYDRSMFCKGFLTAWDKFVEGDTAGVKRF 
RNYILTRSNyASAARAVYEASIKGTVYYMDKSKFLFHDNVNPDLDKSWG^3KNGKKPRLPA 
NLMAFMGIDIVKVCAKGIQKYMFAKQFQHPEVEELVPPMAVYAKVAAGLKSGTLFDDWDL 
PEYENCQFIKYDTEGCKKHSELYAKQLLRTGLNQYNKLEEGQSAFPFANIVTVTSASSDD 
I HG DT HE LM YKTKDG V KGV SKIEDENII KVN P AEE KKNNRVQ AEKTL Y FE I DS DDE VCE 
RTEEEFFRPTSVVAAPTTPLVPSNVEEEEEEEEQMEEEEEEEVEREEGSDKEDDGDAPAQ 
EEMEEEKEEEQQQQPEEESNGNENQEEEQQQQQQPEREEENKDADSDSDSDSSSSSSSSS 
SSSSSSSSSSSSSSSSSENEAEKKKEEEVPAKIQKRKRLSERPSEAASSPKRMRVEEEQQ 
QQLS PSLDI LQTAVDEMMEE I PAPE P I VAT TSPKAATLALKTGFS YSS FVRGDDLSVAGN 
TSPTEPAAVPAAATCTSDVGNDFLDMLDGLPGDIVMQPGECDVTAKFFEGITLPDGTDNE 
CTGFDDLLKATETDNIITTTCFTSPIHPSSNSAPRKDIDNCSSIKRSRAGSLFDTDDDSE 
TNEVEKEAPKRKKHLKKRRNKSHRGSSGSASSSHCMSSDEESEDERDMKSTSKVHKSPKA 
HVKHSPKYDAVNSDVNNSYNNVNSTTCMSSSDSDAEAQPKSHNKSHSRKHSSSSTSDKKQ 
NQQCSINTQNVKKTVVQSPPSFRSFS PKKDELGDFLSRKHTKPVRPYNKKRDNVNTTNNV 
VQRSA 

(SEQ ID NO: 199) 
Top Blast Hits 

Sequences producing significant alignments: 
Q9YTL7 (Q9YTL7) ORF 4 8 

095815 (095815) DENTIN PHOSPHORYN (FRAGMENT) 
GAR2_SCHPO (P41891) GAR 2 PROTEIN 
AAD4 6501 (AAD4 6501) LATENT NUCLEAR ANTIGEN 
040947 (040947) ORF 73 

GARP_PLAFF (P13816) GLUTAMIC ACID-RICH PROTEIN PRECURSOR 
Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 1 to 955 of CT1012: this 
corresponds to nucleotides 44 515 to 4 54 69 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=1345 

5'stop«1366 

3'start=1389 

3»stop=1407 

5 1 primer =CATCCGGAAGTGGAAGAACTTG (residues 1345 to 1366 of SEQ ID NO:200) 
Tm5=59.57 

3'primer=CAATCCTGCGGCAACCTTT (residues 1389 to 1407 of SEQ ID NO: 200) 
Tm3=59. 65 

probe 1 =TGCCTCCTATGGCTGTAT (residues 1367 to 1384 of SEQ ID NO; 200) 

probel3tart=1367 

probelStop=1384 

direction l=Reverse 

Tml=68.99 

scorel=l . 99 

length»63 



Score E 
(bits) Value 

101 4e-20 

99 le-19 

97 6e-19 

94 6e-18 

92 le-17 

91 3e-17 
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CT1013 

Nucleotide 

Genomic coordinates: 

Start: 45950 

Stop: 47825 (SEQ ID NO: 



200) 



Amino Acid 

MSSTTSPSSLWDDDDDDDEEDEKDVKQEVSNRPPIFSYMETVSFSDNDEDDNKGEEECFG 
SNFDMFGDSDNMPSTSTAPFPPPSTTTPLPTPRSIMDTDSDECDEEGAAAASAPSIAASS 
S I PVGI SEAELKKMEKKKRKEI KKLKKMMKDPLPHLYVGGE PPVAADYKTRAN I SLYKVD 
PSIDMCGVAPPQFCAELPTPSIDVYTSSYVFPPPTPAMHNKKGSKKCQFLKGRKALRKWI 
HENVCMAPPGKRGGVFLAHLEQRFLAEHGDEYKVPRMFVSRVLNKAFPNLIARADTLCSD 
MTFYTNLCWIVNGWVCFDKDDGGIHGDASEYATGENFDTVVFHKREEQKTNGSASKKRR 
LTPDTSNMGTSTDVQEFQTMGTNTDMQEFQSMGTNTNPIETSSVGVNTNPLPNPPPRLVI 
TPLTNDV PELDMMWLYSPSRGGGNSRMSANTGTSPLSNTPIPTCFTGGANVWPNGFVPP 
TFPLECDEDDPSIPNSYNYEEDKVFHPFYEYMAKYLSPLVPSYNKGQTCNVVQEWFKGSF 
SLAKRRGTVPKFCSNISHAFFCNMDVCTAMCKWAKTVIRHGQYCNRCIVRRSCTSMLAYH 
YIVCRDASCDVPKCRERVRNDMDD 
(SEQ ID NO: 201) 

Top Blast Hits 

Sequences producing significant alignments: 

AAC97971 (AAC97971) FAS-BINDING PROTEIN 

Q19871 (Q19871) F28D1.6 PROTEIN 

NUCL_MESAU (P08199) NUCLEOLIN (PROTEIN C23) 

AAD56625 (AAD56625) NUCLEOLIN -RELATED PROTEIN NRP 

VP41_YEAST (P38959) VACUOLAR ASSEMBLY PROTEIN VPS41 (VACUOL 

035613 (035613) FAS DEATH DOMAIN-ASSOCIATED PROTEIN (DAXX) 



Score 
(bits) 

47 
44 
43 
43 
43 
43 



E 

Value 

4e-04 
0.004 
0. 006 
0.008 
0.008 
0.008 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 934 of CT1013: this 
corresponds to nucleotides 4 6867 to 47800 of the genomic reference sequence. 
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CT1016 

Nucleotide 

Genomic coordinates: 

Start: 52093 

Stop: 54913 {SEQ ID NO: 202) 
Amino Acid 

MSEPSVYAFI DIKEIENGWEKEFGLLVQPGQKLAPFRDISYDSSKLDCDAFSCI PSDILH 

SDNEKRVGECNFAEHTSVSFPVKNPEGKTLRHFTACGPGCYRRYKQRDPHTGLPVARGVL 

MQDHVDHETGNKMCEYLNQSLVMWAAVPWIRPGDLTEGYNTTHVPGFAFKEDDERDSKRV 

KYENVVISKAYCDFFKQYYDADSGSCYRSGWMKFVHLMFGQYFTNLSYNLANPKPYNLTG 

NTWSDVVSVLTDNPIVDAGAAPSRSEMDEIITKKKFNVFPSEQTSARQKAENIIRSQYGD 

GVEIDPSSVDALMQFVNREGVVGTEKKSDRLMRVADAVMDAAMRLQVMGLDDSQSRRLLL 

KNMI KMSRNN PEYARHFSSSLKLIGVTLAIKRSVFSKGASAKRKETAI NNGEQHRRSRWS 

PETVTEEDALLFARENITEDPKHPAPFVDILHSPDINSSIKSGSSSSIWNDILSRISSTR 

KLEEKASVFVKNLVVKVVRQFLDILEGKLFSDGYEWDDNIPLMIGVDQILREVIKAASNM 

CARFASSALESSLVTGFIDSASAITSRLAVQLAARTFSVFLEESVIEFVVAASLRLAIQA 

FADLATLAASALTVIGIVIFVIQVLGLILDLALGLGWYDHIFSPEDLKKQVLVFRREFAK 

AGNVDVGVAQPVTPEEIVAINVFLQTEENGEEKKEEGARKSKIDFLQKYFHSTPLMGKKS 

KFV Y I QE AAQEYLGGRTMNAFGQRI I TAADDS DTTTTTQEGRRDDETVTKKMRS I ILETG 

QTLKDYSSAVNYNASRLDYVGEEWVRNTALKEETRSNTTSDNLFKKTVSLASMAGAFLVL 

GIGVLVASHITLLRFTNIGLAFAFAGLLAFIALMSISYINMNAMGVVNSDAIYRSTALVG 

D I KT D P RR VGM VQRH VG VGAKYNM I T DFVS PMLDE I ESD 

(SEQ ID NO: 203) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

067140 (067140) L-SERYL-TRNA (SER) SELENIUM TRANSFERASE 41 0.048 

YD86_SCHPO (Q10411) HYPOTHETICAL 222.8 KD PROTEIN C1F3.06C 38 0.32 

Q88606 (Q88606) (CRUCIFER) GENOMIC RNA FOR RNA- DEPENDENT RN 35 2 . 1 

PME3_PHAVU (Q43111) PECTINESTERASE 3 PRECURSOR (EC 3-1.1.11 34 3.6 

081424 (081424) T2H3.2 PROTEIN 34 4.7 

004260 (004260) HYPOTHETICAL 49.5 KD PROTEIN 34 4.7 

Comments : 
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CT1017 

Nucleotide 

Genomic coordinates: 

Start: 55054 

Stop: 58189 (SEQ ID NO: 204) 
Amino Acid 

MDSLTNTVTLLVNDRLGNHRTNKPITEQDVENTLNLNSLERASLLKLYSVFIKEMQSYSG 

CI PKNKYTN VQE I FEDGLITFEWRDGTKVHRSVSPSSPI PLSTKKSPRSS PS PPPSMPS I 

KEEEFEEEFEDDEEIYETDENVEDFINGDGEDSEEEEEEDIIVDDEEEENEEGENKYVLA 

FSNHLRRQTAAAAAAAAAAAAADIEKKDKNHAVSAHDYTLSALQQQQQKLLQQQQQQQHQ 

QRSSSEKVTSTPNKFNKFLLPSNGFSEQTELFVCFDVDKIAQYNGLVELDILPIVAEYII 

NGLGLKCSMETPPVKPCRRKEVKDVWCQPKTSFENDAVEDKHLAFAESPILQRPRDFPIP 

KKITAYFCLDDSVDIKNPWGSCPLLKSGSNFRVSEYSRHFNEFSGVKNDDDTSSNTCFIY 

SQKNPNIEIVSKLNIEFEVMMEGIITHRKDLFETGILSDSSLATAMAFCHPKARVRNVAL 

FYFSVYLPFSKITRKETIKCSETDKVHIGSDAIFSPPSDNPNISAHQNNNNNNNNNTSVN 

IEDRPIRNNNISRKMTITNYQCMACKERCTNNCTNGNYPDRGNQHLSHSVKGEDFFKILN 

NSKVDSLKKLSRVLI PAPPSGNYTSKFCDRSSMCHSFFCRGIEPVSTSFSSDSFEKTKLV 

LYGKVVDVINSYSAIKTSHNNRIRVFFNSEEKDNKTIPSRAESAKNAFKDILVHECNKER 

AVSYFEQNKLSSKDGHLSNKWWIELNDLNIMFEKHVEDFYKKCSKVNDAESLKDIFNDFE 

KTCDKYKTAKRAIIGAQDPSTSTPSKKENGITRIISTLSEFHSKDEATVSALLDKTMLLG 

SRTIMSGVRCVIRNNSVFSGFENKNTNNNWELEIRHYVISMGGAAVTKISDEDLEQFTPV 

RGAVSVTTAPNDKLPVGAHQTWKDEQTLKTNTKRNSLYDSYNSKRNNRDNNKIKNRSLKL 

SDFNWRTPNISIQEFNANKDDVNKKRYAEVVASAAPKSPSPTSSSSSNSNSSSPPLSPLS 

PTVKNSNNKPLYI PPHKRMTTTAV 

(SEQ ID NO: 205) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



077033 (077033) TRFA 


60 


7e-08 


Q94463 (Q94463) K7 KINES IN-LIKE PROTEIN 


59 


2e-07 


NUCL_CHICK (P15771) NUCLEOLIN (PROTEIN C23) 


57 


5e-07 


AAD4 6501 (AAD4 6501) LATENT NUCLEAR ANTIGEN 


57 


8e-07 


040947 (040947) ORF 73 


56 


le-06 


Q98148 (Q98148) ORF73 H0MOLOG 


55 


2e-06 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 215 of CT1017: this 
corresponds to nucleotides 57667 to 57881 of the genomic reference sequence. 

TaqMan Primer/ Probe Sets: 

5'start=1741 

5 ? stop=1762 

3'start=1847 

3'stop»1864 

5 * primer =CGTGGTAACCAGCACTTGTCAC (residues 1741 to 1762 of SEQ ID NO:204) 
Tm5=59. 16 

3 1 prime r=TTCCAGAGGGAGGAGCGG (residues 1847 to 1864 of SEQ ID NO: 204) 
Tm3=59.84 

probe 1=TGAGCAGAGTACTGATTC (residues 1829 to 1846 of SEQ ID NO: 204) 

probe 1 start =182 9 

probelStop=184 6 

direct ionl=Forward 

Tml=69.01 

scorel=l .98 

length=124 
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CT1018 

Nucleotide 

Genomic coordinates: 

Start: 58947 

Stop: 60029 (SEQ ID NO: 206) 
Amino Acid 

MKICQISSPTLTLSI PLEGVYHVKQLLHLKVHLDVKGVKQLLHLKVRLDVRGAKQNPWRK 
NLCLLKKNVKSAKQLPHLKVHLDVKSAKQLPHLKVHLDVRGAKQLPHLKVRLDVKSAKQL 
PHLKVHLDVRGAKQLPHLKVRLDVRGAKQNPWRKNLCLLKKNVKSAKQLPHLKVHLDVKG 
VKQLLHLKVRLDVRGAKQLPHLKVHLDVRGAKQNPWRKNLCLLKKNVKSAKQLPHLKVLL 
DVRGAKQLPHLKVLLDVRGAKQLPHLKVLLDVRGAKQNPWRKNLCLLKKNVKSAKQLPHL 
KVLLDVRGAKQLPHLKVHLDVRGAKQQQQLCLPLKTISTSFTHLLLCLYMEYGKHQNLQV 
X 

(SEQ ID NO: 207) 
Top Blast Hits 



Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



041125 (041125) A643R PROTEIN 

077336 (077336) PFC0425W PROTEIN 

Q65683 (Q65683) 42K TRANSPORT PROTEIN 

Q48275 (Q48275) HYPOTHETICAL PROTEIN (FRAGMENT) 

Q89659 (Q89659) 42K TRANSPORT PROTEIN 



36 
35 
32 
32 
32 



0.34 

0.75 

6.5 

6.5 

6.5 



Comments : 
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CT1019 
Nucleotide 

Genomic coordinates: 
Start: 62203 

Stop: 63019 (SEQ ID NO: 208) 
Amino Acid 

MDVEFGFFHGLL5KALLPDEKHQPVIRRLCADDSRNKGEDGCCSFCGRRGTGESNTACLE 
QLIDVCSFIGTVSSIGTIINSNLSTSCSRLQKTSDSYAALSHSSFLDVVYPSLKKTTEDV 
LPHSLRAIWNKQLPKLYEKTLQPIEEEDIGYKDYVVSIEDDDNVDDGDQQEQMIIDEESY 
KTIGEKSTIELIGMYNNNKFGNEFIRIPLRETALHAQSLRYDTEAKFVNHKDSIPLFYEN 
STCTCKERLI DFSERQLQQLKQDGMDKPTDK 
(SEQ ID NO: 209) 

Top Blast Hits 

Sequences producing significant alignments: 

AAF02799 (AAF02799) F5I10.23 PROTEIN 
023058 (023058) BAC IG005I10 

0PI1_YEAST (P21957) NEGATIVE REGULATOR OF PHOSPHOLIPID BIOS 
Q24042 (Q24042) HYPOTHETICAL 89.2 KD PROTEIN 
Q24043 (Q24043) HYPOTHETICAL PROTEIN (FRAGMENT) 
076871 (076871) EG:100G7.2 PROTEIN 

Comments: 

TaqMan Primer/Probe Sets: 
5 f start=292 
5 f stop=313 
3'start=375 
3 f stop=396 

5 ' primer «GCGGCATTATCCCATTCTAGTT (residues 292 to 313 of SEQ ID NO:208) 
Tm5=58 . 17 

3 1 primer =TTGTTTATTCCAAATGGCACGT (residues 375 to 396 of SEQ ID NO: 208) 
Tm3=58.03 

probe 1=CTGAAGACGTATTGCCTC (residues 350 to 367 of SEQ ID NO: 208) 

probelstart=350 

probelStop=367 

direct ion 1-Forward 

Tml=68.97 

scorel»l . 97 

length=10S 




PCT/US00/28888 



Score E 

(bits) value 

37 0.14 

37 0.14 

34 0.90 

34 1.2 

34 1.2 

34 1.2 
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CT10000 
Nucleotide 
Genomic coordinates: 
Start: 3143 

Stop: 6956 (SEQ ID NO: 210) 
Amino Acid 

MVLTLSCTTRRVASSKGNFSKEDAVLGNQFPILKKSNNLSIARPPSIESFSASVEKIFRE 
WNESGGEKI FDISQNEEEWMDI ISLVESVYEPVFSKSLKPDKLADKTCLTAAAFAALASA 
VDEKLTILSGSDGSVLQRTTKVMKKDPKKIAESLLNNEKWTSILLDRLKTAKKLLSRRGA 
LKSAERVEVLHRLNKLKEAPLPHHPSLFDNFSGGKTSAVSAGTVIAS DMHFKLVEHI FKV 
SFRKWGPCGDKTESGEEEDEEEEEEEKKHSISRFVLQFMNGHNGQHYHRPESASVYFCDY 
YDYLAYRNLPNEYKLSSMHPGTFNMEDLPFRPFAVPSTYKTELEYKRFVQSTNLPQLSFD 
YGEFLCYCIFGADWYKHLGDVVDSLENSSMISFDSQTLSGVYKNTANYKRLGKKRNGIAD 
LAVRSMAEFIRTEAHKALTAEEMEEEEEEEEAEEEAMDQE PAEVDFLSVPHLRRKIRQAV 
SVLNNFVENDLSILVSNFKNVLTDDTVSGTDTDNFGSSGEFEALSSHLFLSRILDEVHIL 
RNTDIQRTLFSTHVSLSDKSPPSRVRGSNVNFNNNAGNISSLQTYGGIEELPENVLVGLS 
GGFEDTDMYSGEDVVVVWDGCDGGKVLSVTFNCGDNFIQLHEKTAETFKDDTDLVERIRD 
VLQTASKTGNLNKKAYSRKNIYAVLRENGIERPGDDFTEKGIALKDKTNQPPPPARSAKI 
TVEGVKGFFSGFRDILETRALTTYSAETFRDLGQGIVKETEGLTAATVAETSFSEGLAES 
LRSDANLGLEFSEDAKTWFKNDTSRSLLEETRALRANNTSFSSFARDMGVQVSADLDAE 
FAAEMRETYPDAALEQNLKDLDKFEET I PESQVKKLKKI DS YLTEN PERAGKE INDTELS 
KATDSVLGKKLGNAVTVLMNNFGKVTIVVGASVVAGFLGPAAVALVHASRGAHLNWDHT 
SPKGVISYKIVDFSCADRNTGWAKPTKHPFREEIDHVIALDASFLTENGAYVFPEDGGPK 
SKYKAYAPICGTKDAAQGECGSWATFDDPHSVLPWVASMKDLPKGQSLSCDKGMSTLKAV 
SSVLLSIGKDVAEAIFEVAEDAVVGLASKAISAVINNPLFIFGVPLGFGIAATRLNPSNW 
KTGLIVFSILLVVILI VRFFAGSGPLTLNWFGAKNSAKRKQTEQFEDGGGNRSKIVLAEK 
DNANSKLQSRRNETGPMRLEELPGHEDLRPVFFPATTNYSKSAKILGYKSKPFNDFYTKI 
INTDIIKMDR 
<SEQ ID NO: 211) 

Top Blast Hits 

Sequences producing significant alignments: 

YK06_YEAST (P36062) HYPOTHETICAL 75.5 KD PROTEIN IN SDH1-CI 
Q12532 (Q12532) HYPOTHETICAL 119.1 KD PROTEIN YPL009C 
075184 (075184) KIAA07 02 PROTEIN 

060721 (O60721) RETINAL ROD NA-CA+K EXCHANGER SPLICE VARIAN 
043485 (043485) RETINAL ROD NA+/CA+, K+ EXCHANGER 
AAD28522 (AAD28522) FLAGELLIN (FRAGMENT) 

Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 1 to 969 of CT10000: this 
corresponds to nucleotides 58 67 to 68 35 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=2102 

5'stop=2125 

3'start=2185 

3'stop=2204 

S'primer^GGATTGCTCTCAAGGATAAAACAA (residues 2102 to 2115 of SEQ ID NO:210) 
Tm5=57.56 

3 ' primer =ATGTCACGAAAACCGCTGAA (residues 2185 to 2204 of SEQ ID NO:210) 
Tm3=57.81 

probe 1 =AAGTGCCAAGAT AACGGT (residues 2148 to 2165 of SEQ ID NO:210) 

probelstart=*214 8 

probe lStop=2 165 

directionl=Forward 

Tml=68.99 

scorel«l . 99 

length=103 



PCT/USOO/28888 



Score E 
(bits) Value 

43 0.010 

43 0.017 

41 0.050 

41 0.050 

41 0.050 

41 0.050 
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CT10001 

Nucleotide 

Genomic coordinates: 

Start : 17098 

Stop: 20342 (SEQ ID NO: 



212) 



Amino Acid 

MGAPTNADFTRTVSGVASSLYLVNPGAPSDREKLVLASSYSDSFVYNYKDAVVTAEAPKW 
CPFNEPALHEH IMNRLEKAGLINRSRFVCNPVKSAGEVCGFRYSGGSTPQNLI FPIGASE 
INWLVILRNAARFGTVAASAKDAIERIPDLREGGTSKHVAKNAMRRLRVWRAFNWIAEAS 
RSAGM I R YEPFS VGCALYDHEVRRRQLKGSYERNVLFLGDH FCKESTLLADI SRGGRS S D 
FWTIVEAVIRYKNRHARTISNETNAIPEDSSINLEWEDVLVKNQRDTNQGDDSTLEKTLE 
AAIKEHESIGEKRKKHILEFIKTCLTEEQREMI FKGVGGKGNLSPAHLTNLADAILANNA 
KAGI W V IMQS LLKQ I N FS I LHL I GYEAQRLLM FKLYM P ALLALFI SQRG I G DV FLNG V FN 
LEVMKERAAN AKI RDMVSRDAYKNNTNESNNLGN YSQFDI AT I YGQMS DCN AI PLS IN I G 
VPLHKSRMNMQDIEKTIQHIVDMGLAKIRVAEENRVARRLLHRRKMQEKAARERAAARQR 
RLRGEEEEEEEEEEEEGEEMEEEEEEAGTSGVNGSGYDQEEEDDGEEEEEEEEEEDEEDS 
EGEDMNGENNSRKRKNTGNTSSTQQPPQKRQRGKNAPISTKGKRLKERDNIGGFLLATIQ 
N DDRQVN VE S I QKLLT ARQRKYVKGGKCGDNGLPELL I EKVTNLLDS V FHFRKGS ILNSI 
HANRRSETGVYTTKANCICDYYERNVSKDSSNNTPHSSECIERARERDASCAESNKRPCP 
VDSNNPEDVEQRMRELIMDPPSLSGVEDSLAIERVLQNEILFTSLVTNPIFNAVLGAEKG 
DLGRFIVLNNIVKFMNMTIACLVDGDMPMLLDSRGKTKNLLEKGTVKNTRKFFKPNMTAA 
ELNVATAQSAGHQYMNAGHCPEPGIKQMLLPDCIMKLKSIAMEKGRGGRSALHRQKCDHA 
FCKMLKCLFFN I DPSNAADTFI DPASRATLFRLDDLCRDRKKYKNI DWVKDLLDPVMKGT 
NKWVGTGEYTNI GRDSNVAAPVDFYT I LKYTMI DDGV I S VPSRKPNDVY YST I ERADDLL 
TESRDASCESYRPTLFDARAVLEVNGDGRVPYPSSETVEDLGGEEEEEEITGVI DDSTEI 
EDVQVQDSNLFDVELFDIPEIEQHQQGGEEEELPSAISEVFASLPADNDSSSPAHIPSFG 
NSEEGEKSPEPYNIFDSALDQLLDLIDSDGRNNNPKRVDWNSVTIQE 
(SEQ ID NO: 213) 



Top Blast Hits 

Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



GARP_PLAFF (P13816) GLUTAMIC ACID-RICH PROTEIN PRECURSOR 
094922 (094 922) KIAA0835 PROTEIN 
Q9YTL7 (Q9YTL7) ORF 4 8 

CENB_CRIGR (P4 8988) MAJOR CENTROMERE AUTOANTIGEN B (CENTROM 
Q07034 (Q07034) RNA BINDING PROTEIN 

NAB3_YEAST (P38996) NUCLEAR POL Y A DEN YLATE D RNA-BINDING PROT 



77 
76 
75 
75 
73 
73 



6e-13 
2e-12 
3e-12 
4e-12 
2e-ll 
2e-ll 



Comments : 
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CT10002 
Nucleotide 
Genomic coordinates: 
Start: 20783 

Stop: 23729 (SEQ ID NO: 214) 
Amino Acid 

MEETITLKESTGTVSPFKNENITRIASNYVRAFTDTWSHLVNISGAPLTAEKNPSAIPAN 

ELNRYWTKTNVLCN PLFKLE DH I TRDE DTGT ITLKFKMYI DDKNGLYQS AVLMLALDS FV 

SLASFSHGADLVSNKSENKFCVKIPHDTRAESLLNNVGFPAGLSGPFKRWSINYKAANLS 

GKSGIDGLSGSMLTVLKNNTNKRATDILHLVNNVSASAQQLDDSEMSRTFNHQKKVGVCY 

DINVSSSRQVNQRNLLHHQNIIGQHLIEFRTKQLERAQNKKVKEEENGEHEEMTSEEEEE 

EDEYEEGGCLSDIDEEDFYEDGYDEEEGDDNRTRKKKKMEEDEEDEEEEYDDEEDEEEAE 

TCGANGVIDCEDDAII FPNGQNSKRKKNGKKTNIKKRSRRKGECSANTLSFVEKYVGNCK 

SLGIKPVGCPPPSTEFTSLFMKGSEADSCYNTCQSTRGASRIRSLLNKYSVKDLMQVNSP 

SSWKWANPPDRRFVLFDKKTKEEVEVKFEIECEKSEYFDWSELPSNIKVWLKETAKIIK 

HLALIEDFLPAMGAATPKIPLNLIKTMTSIFSVRDIVGFKIPEEVLSFIPIEWKTSISAM 

GLLSVQFDRI IEVIDLMITNGAFATSCLNNAFFLERGVVPRDGSNTWLHTDLVQLSTSI F 

RSIRNRGVNIGGNNNTGSNSSSSSCGGNKGDYGVRCGLSISKRGITLKPPPAAMTNSSSP 

SSSAMISLPQPTRQSI DLSITTI IQDFSEVSGKLRLNGLQKNMSDKSKDVFNDAIYDSGA 

FKALLTCTVNDKSRRKRKRRTLLASGEGVVRRNLMVSQGNDVNDAHQFQEECGIKIGGGA 

SRVYKRAQRRGSAVSSRRRVRNKPQFTIAVSDEDDDCEEEGDFSSELNPTHSQLLLFQQR 

QQDSCTEDDDVLVSVEEYNNRVSGSSTTAGDRVLAKDLLSTVSPNEKRNSAALAALTISR 

HSLFNALSAKTKLGENGRFFL 

(SEQ ID NO: 215) 

Top Blast Hits 









Score 


E 


Sequences producing 


significant alignments: 




(bits) 


Value 


Q9YTL7 (Q9YTL7) ORF 


48 




75 


2e-12 


GARP PLAFF (P13816) 


GLUTAMIC ACID-RICH PROTEIN 


PRECURSOR 


75 


3e-12 


Q07034 (Q07034) RNA 


BINDING PROTEIN 




74 


4e-12 


N AB3_YE AS T ( P3 8 9 9 6 ) 


NUCLEAR POLYADENYLATED RNA- 


-BINDING PROT 


74 


4e-12 


Q9YPA9 (Q9YPA9) HYPOTHETICAL 45.2 KD PROTEIN 




72 


2e-ll 


040947 (O40947) ORF 


73 




69 


le-10 



Comments : 



TaqMan Primer/Probe Sets: 

5 , start=1699 

S , stop=1723 

3»start=1773 

3'stop=17 97 

5 , primer=ATGACGAGCATTTTCTCTGTTAGAG (residues 1699 to 1723 of SEQ ID NO: 214) 
Tm5=57 . 07 

3 • pr imer=TGCAGAAATAGATGTCTTCCATTCT (residues 1773 to 1797 of SEQ ID NO: 214) 
Tm3=57.23 

probe 1 = ACCAGAAGAAGTGCTCAG (residues 1743 to 1760 of SEQ ID NO:214) 

probelstart=174 3 

probe 1 S top= 1760 

direct ionl=Reverse 

Tml=68.95 

scorel=l . 95 

length=99 
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CT610 

Nucleotide 

Genomic coordinates : 
Start: 85707 

Stop: 83427 (SEQ ID NO: 216) 
Amino Acid 

MPPKHKPNTALKKHIIRNQQRKKEDDAESRFQRNMGQEVSKLDAPTSSKNRQRRKIRTSK 
ILSRSGDCVAGDCSDLENDEGKRDTDQEGGGRGGGNEEEEEGKEEGEGEEQQREEKEEQS 
EEKEEKDGEEEEEENVEDEHVTPTTSVSKRAKQMKKHIFPPSKKRKRSDTESKALAVPAG 
KMMTVSRPLRGAITSGSILGVRSENAPQYDYVSYLADEAWKEKAIQYRIRSLLANLLKA 
NKTKAFPTSSSLLSSEQGKKKFGGKRTNTFVVTNVGAELVKALLANSCWAISHRKDIRSG 
EIQWQELSSKILKSLNDGNATEINNLMSSIVEDRIQRTVKERVYFEQLATVCNNLFGTRI 
LPNKNFDKNFVSVASDNSNATVRGLSIPRYFRAINNNVWVKMSSTMDLLVGGGMRSKSEH 
SISMLEKCAAGVLARASARPVEKMIKSAVEETSQAFNLSTGVFVPKQQQQQRQQQQQQFP 
PFQPPPFPLPPPQAPFQVQQPTYQGYLNPYYQYNQYYNPYAPQQLQQQYPLYFLGNQSQP 
PPQLQQQQQQQQPPQPPNNIPPPPTPQQQSPSNIPPPQQQQQQPFPVQLISSPPPPPPIP 
NTAPSPPISRVRFDSRSTTPQPPPTPVLPKPTPLPPPSTARAEEENATDMSFTDIDSELG 
SIDFDLPPATPGRNVEEIIKAQRQAVKETGVRGEEEEEEEAFIAPIIRQPRTPGNFRDEL 
LDVNESIYGSDIEPAAAAAAFDWDMGLDDLNGDEPYEFE 
(SEQ ID NO: 217) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

P93794 (P93794) LOW-MOLECULAR- WEIGHT GLUTENIN STORAGE PROTE 84 3e-15 

Q05573 (Q05573) OMEGA SECALIN PRECURSOR 83 5e-15 

AAD43602 (AAD43602) T3P18.1 83 8e-15 

004365 (004365) OMEGA SECALIN 83 8e-15 

048809 (048809) F2401 . 18 83 8e-15 

065375 (065375) F12F1 . 9 PROTEIN 82 le-14 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 34 to 361 of CT610: this 
corresponds to nucleotides 83630 to 83957 of the genomic reference sequence. 



FIGURE 3, Sheet 1 16 of 160 



O ( 9 9 



WO 01/38351 PCT/US00/28888 

158/201 



CT611 

Nucleotide 

Genomic coordinates: 
Start: 88938 

Stop: 85761 (SEQ ID NO: 218) 
Amino Acid 

MEGVTSIVAAAVPEVAILITDLMGGRNNKRSTYERIVGIVGESGDLLEAILDICNRNSYR 

DELLEGETWINPTGLLKEISLLMKKALDMNIKMKSNDDPVPFTTLDQNEQEFIGHLKSC 

KKQDG PAYKDL I H RI YS GMFVMKNTRLMLDE 1 1 RGNAG DAVEE KN ALC E A Y AEM I S DM DL 

IRI FLLLVAI KRDQNKKHRHMKSVI YEDWVSLNTLKDV FHKEWYMW P PS ALQVGTKIRD 

ARTFSVLFGSDMHEGRNNDRIWENMAFSVTEAFLSGPSTNNHYNKGHLRMYAARPVYDAM 

EYVPQELHHILFGTKIAKMIDIVYRYSIYNVPYLLAADTERVEEPKKSVMSPSGLIISPN 

ASLLENTPLSLVSRHGIPSARKLGSFILEHENAENMHLEEAIAKCMVSQTLQEESWGESQ 

AAMVYQPSDEVEVIQAHVTKILSGNTTNKTCGLCYADLDMKPKFFNCSHENMKASYDYFP 

VHAFMDTFEARQETCSAKLCPDCTIKHLMYVYEKVSAGSEKLKDVFRCPCCGEYMVQFIG 

RCHEFSSLFERAILAGENVDPEYIAANKLLITELIKRAEKCFYTVELLQAEFMEMCKMDK 

DFALDKDSKFTVVDNRFRPPVKLFKMVEGETGDSKCSLICTQCLLPNVCDQPNEMEDIVT 

VDV P P PVLPY P PP EQLE DYY FQ DVE DAE FD DP PT DE LVR YDTG PGLH KW PMRLSCGFLAS 

N FVP PNEEVTNCRQAVS I LKRT PEKKI RGWN PES PEGKV LLALAN WH ST DRM PENMKGLL 

NDISVIHNTRERFQNRVKVHYLNSVFGGFDDRDFEQVVGVSI PLI ATYFYVYEKLNHESA 

LGLWAKMFVKNLI GEMVLERPECVFHRAHS FVLHCVDRRALSG IRPNQGAKME I VKQVNI 

VRQNMTSESIKDPVFTVDEKRTLEWKVEKEGQEIKTVKCPKCKTPNIKLGGCITMTCYDC 

SGRRDGYPTVFCWICEDEITNPDHILIDHKLLYSDCKSTKAALEKVYNCTLCCLALRKCS 

DSYLSKQRGGGEEEEIEIYVMEDGFEFDVHTKTAVPTK 

(SEQ ID NO: 219) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

AAF04 637 (AAF04 637) HYPOTHETICAL 84.4 KD PROTEIN 14 95 0.0 

BAA78677 (BAA78677) HFB30 PROTEIN 44 0.006 

AAD21842 (AAD21842) ANDROGEN RECEPTOR ASSOCIATED PROTEIN 54 44 0.006 

094793 (094793) HRIHFB2038 PROTEIN (FRAGMENT) 44 0.006 

Q9XII0 (Q9XII0) F7H1.11 PROTEIN 41 0.042 

YKZ7_YEAST (P36113) HYPOTHETICAL 63.6 KD PROTEIN IN YPT52-G 41 0.056 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 829 of CT611 : this 
corresponds to nucleotides 85988 to 8 6816 of the genomic reference sequence. 

Hit to public SBV sequence: 

gil 6i65655|gb| AF099142.1 : CT nucleotides 1 to 2738 match nucleotides 8582 to 11319 of 
the public sequence with a 99% homology, a score of 5398 and an Evalue of o.o 
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CT612 

Nucleotide 

Genomic coordinates: 

Start: 91607 

Stop: 89060 (SEQ ID NO: 220) 
Amino Acid 

MGSNQQQSFISKRNGTKQEISLEKIIKRIENACLPVNQYVPKLDKNAINPQELASHIMDR 

LPATISFQEMDDFLADYAKTKIVDHPDFGKLAGRFICSNIHKNTKEWNSFSATTQKLRHA 

IHPGTGKPASVVNDTYYENVMANAEILDAVIDYKMDYLFTCFGLRTLEYSxLIKIGSPTD 

RKKRILVERPQDMIMRVAVGIHGSDIKSVIETYDLMSRHYFTHASPTLFNCGTVTPQLSS 

CFLLGLQDDSIEGIYDTLKEAAIISKTAGGLGIHFHDLRAKGSPISSWSGTHPGLMAFLQ 

IFNVSVKKVSQGGDKRRGAAAI YISDWHLDVKDFIDCRKNAGNEDLRTRDLFPAIWVSDL 

FMERVKAGKNWSLMCPHECPGLSDVHGEEFKALYEKYEAEGKGKEVVKARALFDQINSAR 

IETGTPYVCFKDTINRKSNQENVGIIKSSNLCTEIVQYSDSEETAVCNLASIAVNKFVKY 

SPIPSLRPYVDYREMKRVVKIMTRNLDKVIDVNFYAVDKTRISNMKTRPMGLGVQGLADL 

FFKLRIPFESEEAALINKRIFETIYYGALEASCEIAKEKGETYELFEGSPLSKGIFQFDM 

GKEN I KNRDIY FNSLP I HDWEQLRRD IMKYGVHNSMFVAPMPTASTAQI LGNSES FE PLT 

SNMYNRNVLSGSFQVVNEYVIRELIKLGEWNSVTKQRIMASGGSIQTLPNIPKSTKELFK 

TVWEINPRTTLDMAIQRGMFVDQAQSLNLFVEEPELSKVRSMTMYAWEKGIKTLYYLRTK 

GAARAVQFTVDKNVLQEVKKEAPSPVAAFSAPVREEEEEKKSSIWPDPAAALLCSINNP 

GACEMCSS 

(SEQ ID NO: 221) 

Top Blast Hits 









Score 




E 


Sequences producing 


significant alignments: 




(bits) 


Value 


AAF04636 (AAF04636) 


LARGE SUBUNIT OF RIBONUCLEOTIDE RE DUCT A 


1720 


0 


.0 


RIR1_SCHP0 (P36602) 


RI BONUCLEOS I DE-DI PHOSPHATE REDUCTASE 


LA 


807 


0 


.0 


RIR1 MOUSE (P07742) 


RI BONUCLEOS I DE-DI PHOSPHATE REDUCTASE 


Ml 


804 


0 


.0 


RIR1 HUMAN (P23921) 


RIBONUCLEOSIDE-DIPHOSPHATE REDUCTASE 


Ml 


803 


0 


. 0 


AAD37491 (AAD37491) 


RIBONUCLEOTIDE REDUCTASE Ml SUBUNIT 




801 


0 


. 0 


RIR1_YEAST (P21524) 


RI BONUCLEOS I DE-DI PHOSPHATE REDUCTASE 


LA 


796 


0 


.0 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 84 5 of CT612: this 
corresponds to nucleotides 89225 to 90069 of the genomic reference sequence. 

Hit to public SBV sequence: 

gi 1 61656SS igbi AF099142 . i : CT nucleotides 1 to 2547 match nucleotides 5913 to 8459 of 
the public sequence with a 100% homology, a score of 5049 and an Evalue of o.o 

TaqMan Primer/Probe Sets: 

5'start=1327 

5'stop=1348 

3'start=1383 

3'stop=1402 

5 1 pr imer-GTCGGCATCATCAAGTCTTCAA (residues 1327 to 1348 of SEQ ID NO:220) 
Tm5=58.78 

3 1 primer =TGCACACTGCAGTTTCCTCC (residues 1383 to 1402 of SEQ ID NO:220) 
Tm3=58.45 

probe 1=TGTCCAGTACAGTGATTC (residues 1365 to 1382 of SEQ ID NO: 220) 

probelstart=1365 

probe lStop=l 382 

direct ion l=Forward 

Tml=69.00 

scorel^l . 99 

leng th=76 
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CT613 

Nucleotide 

Genomic coordinates: 

Start: 94397 

Stop: 94175 (SEQ ID NO: 222) 
Amino Acid 

MVIRLCFLESITCFVYGIMAPLSLDTNTDYLSHKKDTNKKIQMQINFIPYSNMHVYIAGV 

YTFHEKKGLTYQQY 

{SEQ ID NO: 223) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

096794 (096794) AM I NO PEPTIDASE P 29 4.1 

YA2G_SCHPO (Q09706) HYPOTHETICAL 157.7 KD PROTEIN C2F7 . 16C 29 4.1 

024567 (024567) ESR3G2 29 5.3 

AAC36183 (AAC36183) PUTATIVE PEROXIDASE 28 9.1 

Comments : 

Hit to public SBV sequence: 

gi|6165655|gb|AF099142.1: CT nucleotides 1 to 222 match nucleotides 3447 to 3668 
of the public sequence with a 93% homology, a score of 357 and an Evalue of le- 
101 

TaqMan Primer/Probe Sets: 

5'start=48 

5'stop=*65 

3 f start=169 

3*stop=194 

5 1 pr imer=TGGCATCATGGCACCACT (residues 48 to 65 of SEQ ID NO: 222) 
Tm5=57 . 92 

3 • primer =TCATGAAATGTGTACACACCTGCTAT (residues 169 to 194 of SEQ ID NO:222) 
Tm3=58.55 

probe 1 =CTTTGGACACAAAT ACCGA (residues 68 to 86 of SEQ ID NO: 222) 

probelstart=68 

probelStop=86 

directionl-Reverse 

Tml=68.82 

scorel=l .22 

length=147 
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CT614 

Nucleotide 

Genomic coordinates: 

Start: 129006 

Stop: 127158 (SEQ ID NO: 224) 



Amino Acid 

MEYIGEKNNNPVSNESVSEKELKLRSSFLMIGKKTSKYEQVMGVYEAIESIRQSELSEDT 
FWHVKKDKQLKFARGLKRLQELVEDDSLRIERISCAPPEPGHLFKDDAGHVTDEEWLAT 
QEEDVRKINTIVKEKLKRKDKDFKFSQLYRYMSNSLSEAVEKKHDCMI ISSDFLIGLGFS 
TMNVTHALKSMERTMQKHGFKDMMVPLVEICHRTHYKGEYIANPIFKSHSSHCLIVPLFM 
VAGVFARSAHPSAASIEMYLSTLAYAVILYSDEKQRQIREELARKNLQIKEELENQVEKT 
TKVEKELETQVVKTTKVEKELETQVVKKEEYKNSYIETEQLFKVSEEQKESLRNVHKKSS 
NATFRYDSGSCLVFSISSTEFYLLCRTDKSGSFETATENGLRYIFSPINKKRDTAGMRPR 
LIMAVTGCDAPIACNDSIKHQNKFKVLKCNRSSIVFQTPPS DEDLKGIVQKVTGSDIRI F 
MNDGTVYQDGQRIDISSPQELDEENMTQFEIEQQRKLHSMMENTSKIVTRYNKERHLTTK 
EARTRNKTEKWFEKVKKREEQKKRENGEQSTSEQEQRGVKRTWENDNEFDSDVEEEEDGN 
NTQEQQRVKRHAISV 
(SEQ ID NO: 225) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q06166 (Q06166) MATURE PARASITE- INFECTED ERYTHROCYTE SURFAC 54 2e-06 

066878 (066878) CHROMOSOME ASSEMBLY PROTEIN HOMOLOG 53 4e-06 

GLE1_ YEAST (Q12315) RNA EXPORT FACTOR GLE1 49 8e-05 

035788 (035788) CYCLIC NUCLEOT I DE-GATED CHANNEL BETA SUBUNI 48 2e-04 

Q14789 (Q14789) GIANTIN (GCP372) (MACROGOLGIN) (GOLGI AUTOA 48 2e-04 

Q9ZU69 (Q9ZU69) PUTATIVE VICILIN STORAGE PROTEIN (GLOBULIN- 47 3e-04 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 2 to 976 of CT614: this 
corresponds to nucleotides 127217 to 128191 of the genomic reference sequence. 
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CT615 

Nucleotide 
Genomic coordinates : 
Start: 130290 

Stop: 129405 (SEQ ID NO: 226) 
Amino Acid 

MEYIGEQKLINLLDETPEEDELQLRSSFLMIGEKQYEKYEEVMSTFEAVETIRKSEFRDG 
VFIVQLKENKHITFEGGLKELRELTGDNSLKIESLLSSIKPEKGHVILKNTSTTTDDEWL 
ASQDKDVQEVNKLVKEKTRMLFRGFYFSPTYRYITKSLPQI PFGEKERFVVSTDFLIGLG 
FSADDVMEKLIAIEGNMRKSGLKYTWVPVAEVCHLKKYKGDIVVNPIFKSYHSHCLVIPL 
VYLGYMFSRNVQPPSLEVETYLLALAFAIDLYGREEMRKSCMRLCEDISEVKRG 
(SEQ ID NO: 227) 

Top Blast Hits 

Sequences producing significant alignments: 

Q13779 (Q13779) APOLIPOPROTEIN B-48 (FRAGMENT) 
APB_HUMAN (P04114) APOLIPOPROTEIN B-100 PRECURSOR (APO B-10 
Q13788 (Q13788) APOLIPOPROTEIN B-100 (FRAGMENT) 
P96470 (P96470) IGA-SPECIFIC METALLOENDOPEPTIDASE PRECURSOR 
028789 (028789) SIGNAL-TRANSDUCING HISTIDINE KINASE 
CAB55172 (CAB55172) HYPOTHETICAL 77.9 KD PROTEIN 

Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 1 to 725 of CT615: this 
corresponds to nucleotides 1294 99 to 130223 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=296 

5 , stop=318 

3'start=360 

3'stop=383 

5 * prime r=TTAAGCCTGAGAAAGGACACGTT (residues 296 to 318 of SEQ ID NO: 226) 
Tm5=58.14 

3 , primer«TGTACGTCTTTGTCTTGAGAAGCA (residues 360 to 383 of SEQ ID NO: 226) 
Tm3=57.87 

probe 1 =CT ACTGATGACG AGTGGC (residues 341 to 358 of SEQ ID NO: 226) 

probelstart=341 

probe lStop=358 

directionl^Forward 

Tml=68.84 

scorel=l . 84 

length=88 



Score E 

(bits) value 

36 0.34 

36 0.34 

36 0.34 

36 0.44 

35 0.58 

35 0.76 
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CT616 

Nucleotide 

Genomic coordinates: 

Start: 147517 

Stop: 144748 (SEQ ID NO: 228) 
Amino Acid 

MESIKEQQQQQPTVTFSEEPDQVYEFEDTTTTSAKKPTPSKAKFAAGRRMVSKQRRNTIR 

SPHTETVEEVVGEEEEQQQQTPPEITPAEKKQQSLQELDALMGKVPAHLDVSVLAKSVAE 

FLENDEDEDEELEKNKKAQKSVLFNSVMNSGRTELSPSTFCDGCVSKVKSAFEGKDLVSN 

IVKVEGEAVKKTAIATDTTKLANLFLGCMNLQFHEHVTIETLNKKALDKGGPLFTLKLSD 

AVYVDEMDLEKKRQIFGSNGDKSLFKELGGNYIDSAIKSTGLVMSTPSSSSTKKAGTHFK 

TTNQIVEESVTESMRNGCCCFKNDKWLAKRESNLKSLNNTV FGEEDDEKSAYAYSDSEDE 

DEDENEEEVDYDYNNETIESSVGNVIKNLIRKTIGLSDVEEEKEEGEQSEEEEEDSDDDD 

DDASSVCSSSSSSSSVTWAAAEEEEEEDEEDKDKDTATWEDEDDKESVISSSSEDSEE 

DEDDDGATSQCSEVVFGDVTECEFDESDGNPLYLASDNSFRPSASVTKYPQSEEEMDVSL 

LSKNRSTPVCLSLCRHSSGCITNSFNMSTILKSLKLFPAGTEAAEDCVHIESTKKKDEDE 

DEE DQGLDLQNSQYYS VLVDVDNLI I FSMGSTTYESSMVEV DY DKS FWSSFDKS VKPYCE 

SKKSALINALCEDNVTAKVYATVHTLAIPFCESMPINHINNTTPYGSYKTFRISLPGNFS 

GQHNDINNNWRSDMYTKMVENLLKREVVENKTHSRRYVRNLIVDGGVGENSGNYLKVHEN 

NEDI FGSIEANSMSAKTAAAAFKNVAKKCDLIQTTTNDILTGPFKQYLI DYKYNSARKNI 

IMEPCEGDETTAHEMKRAQDAYKQALHRAKITASSISLRGIWHEMITRDMNTTYNSMFMY 

IPDFYKYVQVSPVNVSPLYMLD 

(SEQ ID NO: 229) 

Top Blast Hits 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 476 to 1 of CT616: this 
corresponds to nucleotides 145115 to 145590 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5*start-1507 

5' stop=l526 

3'start=1554 

3'stop=1575 

5'primer=GAATTCGACGAGAGCGATGG (residues 1507 to 1526 of SEQ ID NO:228) 
Tm5=59.36 

3 ' pr imer=TGAAGCAGATGGTCTGAAGCTG (residues 1554 to 1575 of SEQ ID NO:228) 
Tm3=59.21 

probel=CTACCTGGCTTCAGACAA (residues 1536 to 1553 of SEQ ID NO:228) 

probelstart=1536 

probelStop=1553 

direct ion l=Forward 

Tml=69.02 

scorel«l . 97 

length*=69 



Sequences producing 
Q9YTL7 (Q9YTL7) ORF 
AAD56625 (AAD56625) 
NUCL_CHICK (P15771) 
GARP_PLAFF (P13816) 
IE68_PRVKA (P24827) 
NUCL_MESAU (P08199) 



significant alignments: 



Score E 
(bits) Value 



48 

NUCLEOLIN- RELATED PROTEIN NRP 
NUCLEOLIN (PROTEIN C23) 

GLUTAMIC ACID-RICH PROTEIN PRECURSOR 
IMMEDIATE- EARLY PROTEIN RSP4 0 
NUCLEOLIN (PROTEIN C23) 



85 le-15 

77 6e-13 

76 le-12 

75 2e-12 

75 2e-12 

75 3e-12 
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CT617 

Nucleotide 
Genomic coordinates: 
Start: 148612 

Stop: 147766 (SEQ ID NO: 230) 
Amino Acid 

MSPVI SQQSS PSATSTAAARI I STANLRVLGVKNKEEKDEEEEQQEVEPEI IEPATDFE I 
PFSPALTICI YINANRIHINSKGVCLNRKKIKPTSTINKNQDVPPELANASSYLVQTEHV 
TDKFLSSHCSICNYNVNDGEYKSALSTTRNGDQPLMRKSVRYVPLNEDNVVVQKGTYYGT 
TFIPEKTGRRI LWFSHYKKSPRPITAKLCCLLETINSFNGSCSSSSSASSSSNAPGPIEE 
FQVSSSI FFKKEECCPLQMKWVEQNELDAESPVLVLLMLAL 
(SEQ ID NO: 231) 

Top Blast Hits 

Sequences producing significant alignments: 

Q9Z2U2 (Q9Z2U2) 2N-15 TRANSCRIPTION FACTOR 

Q20150 (Q20150) F38B7.5 PROTEIN 

094002 (094 002) SEC12 HOMOLOGUE 

Q20497 (Q20497) F47A4.2 PROTEIN 

013779 (013779) HYPOTHETICAL 71.2 KD MEMBRANE PROTEIN C17G6 

P93002 (P93002) REGULATORY PROTEIN NPR1 

Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 1 to 741 of CT617 : this 
corresponds to nucleotides 147819 to 148559 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=363 
5 , stop=383 
3 , start=446 
3'stop=467 

5 ' prime r=CGACAAGTTCCTTTCATCCCA (residues 363 to 383 of SEQ ID NO: 230) 
Tm5=58. 62 

3 ' prime r=ATCAAGGGCTGATCTCCATTTC (residues 446 to 467 of SEQ ID NO: 230) 
Tm3=58. 15 

probe 1 =CGGCTCT AAGCACAACAA (residues 428 to 445 of SEQ ID NO: 230 ) 

probelstart=428 

probelStop s =44 5 

direct ion l=Reverse 

Tml=68. 92 

scorel=1.92 

length=105 



Score E 
(bits) Value 

36 0.24 

35 0.70 

34 1.2 

34 1.6 

33 2.1 

32 3.5 
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CT618 

Nucleotide 

Genomic coordinates: 

Start: 150145 

Stop: 148675 (SEQ ID NO: 232) 



Amino Acid 

MESVRDVKFYTFT4NVLAEKAKKIQRLNKDKGWRTSINAEIGYGGARLMDVRFTGRKSMDE 

LARCLYNCDGEYTTLRLVGSSAGNIIVYSLAFIMGIRGECCGFNVNNRLRMGKIIDRELF 

YKITGLNFPETVKCTCDGVRAICDLFLEVAALQEHPAWHETKEVGKKQQQHFNEFGSQYP 

GTKFNKRHKLSTKIIQQMFSEEKTMEQVLAFSEGTAASGFSDLYVEAPIQYVVNMYRAIS 

NMEGRVGAMYNLSRVLILLCSRWEKKPGYKNDFYSKCEMYIGSKKIVDDESFIFTDLITG 

DLVPLVRLAPSNEDI QRDVI RFNDST DI LMDS I DVRDVVLPVLSKI I WQNVS ARLKLRNN 

KSLSKLAKWKWNGMVSTHDNFDSNDYVIEHKRQLAADIMSDSLSKNHLPNFSKTITEYDE 

KENKTTPLICWNYIFELSPMGKHLFPLEEVCGFYEASLPLITPWQLKVVQKKRGRQMVIY 

GPRKRPRTQ 

(SEQ ID NO: 233) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q64568 (Q64568) AT PAS E (PLASMA MEMBRANE CALCIUM ATPASE) 37 0.28 

ATC3_HUMAN (Q16720) CALCIUM-TRANSPORTING ATPASE PLASMA MEMB 33 5.4 

Q12019 (Q12019) HYPOTHETICAL 559.3 KD PROTEIN 32 7.0 



Comments : 



TaqMan Primer/Probe Sets: 



5*start=676 
5'stop«701 
3*start=823 
3'stop=849 

5 ' primer =GAAGCACCTATACAATACGTGGTTAA (residues 676 to 701 of SEQ ID NO: 233) 
Tm5=56.76 

3 1 pr imer-GGAGCCTATGTACATCTCACACTTACT (residues 823 to 849 of SEQ ID NO: 233) 
Tm3=57.26 

probe 1=AGAGTAGGTGCCATGTAT (residues 733 to 750 of SEQ ID NO: 233) 

probelstart=733 

probelStop=750 

direction 1= Forward 

Tml=69.06 

scorel=l . 93 

length=174 
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CT619 

Nucleotide 

Genomic coordinates: 

Start: 169814 

Stop: 165116 (SEQ ID NO: 234) 
Amino Acid 

MSTTQTQTIERPLPGKNNEDNSRLACLLAEGLQQQQQQQDGDSEISLPLVNAGTFACYDS 
TLANLTEGRLGSETENAKIRVKIHPSVFIIETNKEMTIEEISTKSLNALVEKRAREARRF 
S SLTEQKFPRGGGGCYSRKNERFI EGEI NN I KLNMEETASSLERLAGLLPWI N I KDWTM 
H DEKE IRLDLKGNDGMEELVN I SHLNQEEWEMERLSS S I VLKDAYGVFYAHHGI LDI VLT 
TSRFTGKLLQHPVIFRLMDVKVWINTPLQIAFPDTSKNPNAKKILYQHPSLTRLRDLNDM 
ASNSKSVSSIIIPELSKFNSTEFGMHYFTAQCFFGKNTNSLKDLVTRYYQLSFKNKPQPK 
LYEPTATATAASSSSSTASLTTEQKEKIAQSILSSKGKSLGDVSSTLSKEYDENRKRTKR 
QKTSTDTNIVPSGAPTSISMKNPVTCFFGPQYTSIMDCISEKTDWIEMHLFLTSLNDAEH 
NKTLVVDRKSNVSEIHDSGRFLTFGQNNTTAFIPDVDIPTLKLILRDDSGESSAAIIASL 
IYYNNVNLEGREFSNVSDAVVGLFSGGSAITVGDIAREIASIYNIGRESNCDSILFPGEP 
ILAGRRSYGRQYRWYDPINCVVGLYRSCLETMTRNIMRGQPVKVDETAWMYMHQQVLQVV 
LLPFFDCVLKSGVWAVKEARQLTDYIVREVLLKYTADPDQHKFLLFKKPVMDLIAKIVTH 
YAVIHSAADNGGVCLAFPRDPPFIVENDTSLRYYTLTDTPQSILNGDNVAENLKSATSVA 
SSPSSSSRYSSETPIRVVNLPVPTGRFLKMNKDLELFINVPLISSKEQKQQQQQTTATAP 
FSSETISKSFLNYVPPKSLTRNVTYGQNIAEDGFLGLKNKGELVSYFKVVKNTERDDGIK 
DMEIGDINNHQDDTGSLSSSSSSFVDGVRTSFSVDGKIEHVSAFLPGTTSQPTNLPVHAS 
KQVKYSVKELGMETVFFEPLLSSAVLYEASKTKSTQHLSPMRIYKECVSPLSTGRIDIFP 
SKVGTVAGTGFEFIWKVLQYDTGLPTTLERLSPKIPSVPISGEDSKMEVIAESGKGVQNI 
IAIAADQLRGSNNIVGGGTRRAIQQQQQQQQQETQAVVPVNVPARFEPTFTEIELFLQNK 
FRNVIATIISRMMMLVSNEEMKIIKEVCEHVSHIMVDGLYVALDPRKAIEEILERITAEQ 
NGITIDTGNEGYGSLRYASSGRLFINDEASEEAAAAIGGGGALGTGRRVPVELRSILDKL 
NTIGSTTQQQQQQQRQQRQANNNNTVPEDIKVHNEQMQKIRDSSLFTSKLLNYIRDDGRK 
DRIKTNISETLKKYSRIPSYFIASKAQKPIPWKHTKDNINLNKIPEDLNFSPAQNLFVPV 
NPRHILTDMQWLNCISIIETATRDSAIVMQSFQEQADKTTTQLEELLSQWNNIVSQVTDE 
KS PA Y VS SVKLE WLNNE ASR I AAI REN S EKSK I VMGVQGKI VN I DELG I V AVAR S I V DV D 
FYIKMPNVWASRDWKNLIYYAVNIAATPLINNISRGIMAASQTSVLYDSSLALIAAEQAT 
RNITM 

(SEQ ID NO: 235) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

ST20_YEAST (Q03497) SERINE /THREONINE- PROTEIN KINASE STE20 ( 46 0.001 

YN23_YEAST (P53832) HYPOTHETICAL 52.3 KD PROTEIN IN MRPL10- 41 0.047 

Q24523 (Q24523) BUNCHED PROTEIN , CLASS 2 ISOFORM (SHORTSIGH 40 0.11 

NIT4_NEUCR (P2834 9) NITROGEN ASSIMILATION TRANSCRIPTION FAC 39 0.24 

062235 (062235) F36F2.3 PROTEIN 39 0.31 

MYSG CHICK (P10587) MYOSIN HEAVY CHAIN, GIZZARD SMOOTH MUSC 38 0.53 
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Comments: 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 234 to 1099 of CT619: this 
corresponds to nucleotides 165434 to 166299 of the genomic reference sequence. 

TaqMan Primer/Probe Sets; 

5'start=1920 

5'stop=1941 

3 ( start=2022 

3'stop=2045 

5'primer=GCCAGTGAAAGTGGATGAGACC (residues 1920 to 1941 of SEQ ID NO:234) 
Tm5=59.57 

3 * primer=AGTTGTCTAGCCTCCTTTACAGCC (residues 2022 to 2045 of SEQ ID NO:234> 
Tm3=S8 .24 

probe 1=CTTGGATGTACATGCACC (residues 194 3 to 1960 of SEQ ID NO: 234) 

probelstart=1943 

probelStop=l 960 

direct ionl=Forward 

Tml=69.03 

scorel^l . 96 

length=126 



FIGURE 3, Sheet 126 of 160 



WO 01/38351 PCT/USOO/28888 

168/201 



CT1020 
Nucleotide 
Genomic coordinates: 
Start : 62990 

Stop: 63659(SEQ ID NO: 236) 
Amino Acid 

MEWINQRTSREDLFNTYTGNAVIRSAAKQALAIEKHAAERRGEKAWTTSAAAAASSNFNN 
VQQDYT DDDI TQVS I ANSVLNN PFLKRYAKLI DNLAI SSLPPDI EDDV 1 1 HTR DASNSTV 
RVDGANIYFAIIDGDLCVYPKQYISDKVLCGSLNREKALFYNSSKNKWTYGCNLNFDIVD 
AAIMKHPDYKEETTSTKHIRKILGIGASEKLNITHYLNYFIQ 
(SEQ ID NO: 237) 

Top Blast Hits 

Sequences producing significant alignments: 
P75338 (P75338) MG307 HOMOLOG 

Q80990 (Q80990) LATE MINOR CAPSID PROTEIN L2 (FRAGMENT) 
Q18668 (Q18668) HYPOTHETICAL PROTEIN C47D12.2 
Q80991 (Q80991) LATE MINOR CAPSID PROTEIN L2 (FRAGMENT) 
Q80985 (Q80985) LATE MINOR CAPSID PROTEIN L2 (FRAGMENT) 
Q02689 (Q02689) HYPOTHETICAL 45.6 KD PROTEIN IN COI INTRON 
Continents : 

TaqMan Primer/Probe Sets: 
5*start=031 
5'stop=354 
3'start=379 
3'stop=402 

5 * primer =CACACTAGAGATGCCTCCAACTCT (residues 331 to 354 of SEQ ID NO: 236) 
Tm5=57 .74 

3 'primer =ACCGTCAATTATGGCGAAATAGAT (residues 379 to 4 02 of SEQ ID NO: 236) 
Tm3=58.84 

probe 1=CAGTCAGAGTAGATGGAG (residues 35 6 to 373 of SEQ ID NO: 236) 

probe 1 s ta r t=35 6 

probe IS top=373 

direction l=Forward 

Tml^69.01 

scorel=l . 98 

length=72 



Score E 
(bits) Value 

32 2.7 

31 7.9 

31 7.9 

31 7.9 

31 7.9 

31 7.9 
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CT1021 

Nucleotide 

Genomic coordinates: 

Start: 69264 

Stop: 76212 (SEQ ID NO: 238) 



Amino Acid 

MSDTGQMEENRPATQKRRPGDEEEEETGSSNVPYYANFGDDATYSMYTGEGKRGKFVLEP 
PKERSVQRVQKPPKEKEEREQRSNVRTRRPGQEFEQKVLQDRSRERSEKU3QNLAEKGLQ 
ERQKKYTPKVAQTMTKKIIRFREGGRKFKAPQQQTSDKGAATNVLEREEIEMAAEREQPV 
EITGDTILGGLGEEDDE DMGEDELTIQHSSMAVSQPVQQIWSSPIPPKPTRPAPDIPIQ 
EDIVGKNISQLPPLPLDDYEDEEDEHLYEEVNDFLVAPPTAAAAASTRPPRPNIPPPPPP 
VVAVADETLKNLASIAALEKEAEEQRAAAVEREREVEEQRAAAAAAAAAAAAQREADEKR 
EREAEEQRAAAAAAAAAAAQREADEKKEREVEEQRAAAAAAAAAAAQREADEKREREVEE 
QRAAAAAEREILAQQLQEMKEQMRIKEEERRKELADKEEEKRRELAAKEEEKRQEILAKE 
EQLEKLNFQLGTEITSKRALEQMLEEEKASRSRSAASAQLAIQAIEYEDELPQAVEPQGQ 
LVPMDTDLYGKMYDLNKKLEVQNNTLTSAFEDVNKTNEQNQLVAQSLEKSAKAIEKLTSQ 
KHLPVDDPAFMQRIITERDFSLKNLGNVYKRVLGVYFTLKRDLFKSKALITDKESRDLEV 
RLTDVSTDLRANDLNTI LERLDVSVN I RSGGTLYTKFTEADTALADQVPSRI E I SNRSRS 
ALLPFSSAGLDTNFTNSSDKYNEIVNQLSSINEAMNILKENIVPTLNQIKI DVTNLLTVS 
SSRQYAIEERVYSDVSRMDSEIRKFLAIMNSKI SPYFKGDWTDERQRS IADS I SSQIKSN 
DKIKESVATLHDINTTSRIRSNPLLHKSSVLSSPDFLNAVNDFRNFLDIQGGSQFTYDVL 
SGQN I DDLS LAS KTTEKVTELCLELS I ILDV I HKNALSLNLPAI TY PAGET SMEESG SLA 
VDIRQEIGKNISDSSAELSRTLSEALQIFQQQQQQQQQQFQQQLLQQQQDQQNQQQLLQQ 
QIEEQQRVQEQQQQQQRDQQQQEQQQREQQQQQQQREQQQQREQQQQQQQQQSDQFRQQL 
LQQQQQFQQLLQQQGRRRGGDDGDEEREEREEGAEKDDCVRKVAESVATKYTADLTTLFQ 
REENN FQSKIASAKLGTLVFATPPSPIMNLTKLREEYSTFTTQCFSKLTAENNSIMRIFP 
ERIVEVCKSKNLNLMGKYLYIITTAQTEMEDRVKNILSGIFNQIEEFSNNVKQQQQQQAA 
SASSTNPPPPSTPSTTPPVTSMQVCELDDQRTLEKAAIVEAITLANAVLQTTKSASAPST 
AAEREIALKLENGKTSIRMEKVDLSSGATGVSDQQKWIDESTSKQELEDFIAEENFVETA 
HNEMDIGLILDAKKNDPTRDANLRLVKPHGINVQSFPYYVLRTWLGETDILDEDTVHPEY 
FRQYIDRNWKVEEHEREDTLKALGVSLSDTLAHIKDYYSPSVKNDASKSVPFALNTLLYN 
IFAIDGGMISSLSRTAFIYRKFLRQSMTDKEVAQGPVRSQLCEATIASLFTACSNLLRSS 
PLADKVEPRLQEKL7VAAAAVDTSTGDMFRIRVCHLMYNFIVAYVNLCNNRINYTLNVLRA 
SGLANKKVVAGKTTKGHTSSSHRFGSYDVTYDFSVLYKILQLQKQNISLLLEKGFNAWES 
CVAAMAAFTADPSLS I S DADQS I LFPLEGGE I VIEKHENDAEKNVDMVQELWKETALTLM 
AKELNSYYNWFIYSKDTDMEKLARVCRMIIGIVKAVLRLTNKAESLVDTNALSDIFKLPV 
IPIDDTKTLAINIVVFTLNNVIKPWMVSFKQMFRQKDGGVSSAYFSFQNIQQQKHQQTAS 
ILDAWACAPGKLTKAAHVFISGYENHIKLKKDDLLWGASMKFPADGRGTVVEGWAQQYNN 
ESVLEDFTDFSIEVNAPASGLLIPPDPLLSSMFGKGNGGSSSSSSKDNTIIGKGGLILNR 
QVVGQEQAPPINTSSDTKKIRRDANIEPIIGTPYSVIKASKGVSISVLDDFNEDSPEDFA 
LKTS I IN DAI REIGQRMT YTRP I FDHQTQKN I HYSS PKI I LEGSDLKNGQRSGQSWAPSS 
SSLTLASDWNLPSLELLYRELATKQVEKEEEEKSEREEDKGQKLNEKLSFVVNKAIGTIQ 
QQHQYSERGGGMKRYQQHSADQASNGGIDDIELMNSKDATSMRBCAKLALAVTNKIAAAAA 
RDGENSSAKPSNFGNRLDEAINPGALLLRRGGGVRGGQTPQSSMLTMFRPGQTGGNSSWW 
TTNTPLIQRTTSVGNNLVVLVPNLLDSHPPTFN 
(SEQ ID NO: 239) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

076853 (076853) SRF RELATED PROTEIN 136 2e-30 

AAD50121 (AAD50121) ADENYLYL CYCLASE 131 4e-29 

077033 (077033) TRFA 120 8e-26 

AAD4 6501 (AAD4 6501) LATENT NUCLEAR ANTIGEN 120 le-25 

088542 (088542) OPA-CONTAINING PROTEIN 1 118 4e-25 

Q62006 (Q62006) OPA REPEAT (FRAGMENT) 118 5e-25 
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Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 462 of CT1021: this 
corresponds to nucleotides 75184 to 7564 5 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=3774 

5'stop=3799 

3 , start=3851 

3'stop«3872 

5 , primer=AGCTGCTTCAGCTTCTTCTACTAATC (residues 3774 to 3799 of SEQ ID NO: 238) 
Tm5=57 .20 

3 , primer=CGTTGATCATCCAACTCACAAA (residues 3851 to 3872 of SEQ ID NO:238) 
Tm3=57 . 20 

probe 1=CTGTTACAAGCATGCAAG (residues 3833 to 3850 of SEQ ID NO: 238) 

probelstart=3833 

probelStop=3850 

direct ionl=Forward 

Tml-69. 00 

scorel=l .99 

length=99 
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CT1022 

Nucleotide 

Genomic coordinates: 

Start: 79064 

Stop: 83375 (SEQ ID NO: 240) 
Amino Acid 

MDFEGTTSSTPSKMSQLYSSVKKVAEHSFANLHDKATLASKVIKDLEGERKKMSTPKSSS 
DGQKLDKAMLDDIINEYQAVKSTADNSIESTIKEIENVLESVRRTKIESEAKNSVTSSPE 
KVFSVEDLEIYSKGRVCKGLKLNANCSRIGGKYAVSMSIKKHNVSSFENNNNQVFSEEPR 
DCFMLETTYPLVGFETSTEDGNTYAVFLTGVGLERSLPKYVPVFDMNAGIQTLNMTGLRM 
AKLPVLCMFGRTEYDNLEDFYITSIETQSFDEEENDARMRCHTEDLERKKRMNDAPAITP 
HVAVYDYSGDGKEQLLYMITEYENTASWCNANGVVTSDSGFSNECAISDMNDLCCFADCI 
DVTVNNEEHEERSMNIVVESDRRLFDASPSPIKTEEDGENSSSSSSSPTVPPPTPYEGNA 
VVEGEEEEEEIDEDESSKYEGSEDALVMKKLAKLSTMKQMRRVKNEPALKITSGGNNSSS 
SINNEDDGDDDDAVDATALCPQTEATVKNSFMAPNDERTENILYETMQISLAKICNNPSS 
MSSYRVFTNKLQECLNTMDDSIRRRPTIWTEESQQFAKGLLFDEVVTSIVAHQMAQDICK 
SEIFGGMFNANSTNIKGKYEGQKKSLYGNKHISSSCFKTNTESNVNNALFAWVKSKLHSG 
TVIPNVFSFKMASEKPSKMKRKRTSSASSSNDEHQEPSTKMMKN DEGEKVAQESSSPSSS 
STPEQQQQAGHDKETINLIPLSFIKMPRSNVNGSASYLSEIFGQRLCGLSDASSTFKRMC 
KTFEDLENEIMRSSFTRLTRYEREVTRLYEKCRSQAVDIEENEMDVLSHQGELFAEFLED 
PIAYFEEVLENIKSWSLENVNTPKRKNKYAKVLVSVNAIRRTYEEYHAFSKFVPMFLFNL 
IKRELEGDNYTHDVHFSSTCLWYLTVMTRNRICDVLQYINNNNNDNEETDIVEEEEEGEG 
EEDKMEESMDVEQQKQVRKGGRKKGQKFNSIGDQVIRKFVKSLCENSMVVSIAINSLISG 
ISWMNKKIPPGFLKDSSTINTLDEVSRFVFS DVKINRKINGTDDKYETVFGVSTRVDSHI 
VGPFSIPVDFSSAGLDKASCGKLYVNTI DGKGILTISPKYDSLNDEDVDSTTTDKLEKDI 
LHLSKHDTFFNINKNKVLPFYNISPSSSLTEKKKTKFNRKKISSGMSNNNGMCVQTPSSS 
NSVSSVSSIVAPSSSVLALSCSLSSTKKKSIWNENMFLTSRNMWRCGFVVPPKLCSFIVN 
HRHAVKLVAETAPKTKLCRNIIDRNRKIRFNGLKKVCKSVSAFTGESTYLLNKNMTATSP 
SDLNLCIYTSSLNDPLYTCKLTHEEYQDGNALDDYGAVFVNYTFKSIKSCSSKDETADDN 
AAAADDDGSTTSTSSST DTDAAAIQDFMHVM I KKI DAMKDI RGKYKKSLAKKTKKH 
(SEQ ID NO: 241) 

Top Blast Hits 

Score* E 

Sequences producing significant alignments: (bits) Value 

UBP1_YEAST (P25037) UBIQUITIN CARBOXYL-TERMINAL HYDROLASE 1 50 le-04 

097236 (097236) PFC0230C PROTEIN 48 3e-04 

CAB43859 (CAB43859) HYPOTHETICAL 85.2 KD PROTEIN 46 0.001 

NSR1_YEAST (P27476) NUCLEAR LOCALIZATION SEQUENCE BINDING P 44 0.006 

Q21000 (Q21000) SIMILARITY TO C. ELEGANS MYOSIN HEAVY CHAIN 44 0.006 

Q18918 (Q18918) CODED FOR BY C. ELEGANS CDNA CM11B12 44 0.006 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 1010 of CT1022: this 
corresponds to nucleotides 82212 to 83222 of the genomic reference sequence. 
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CT1023 
Nucleotide 
Genomic coordinates: 
Start: 93228 

Stop: 94137 (SEQ ID NO: 242) 
Amino Acid 

MVSS I THLSLLFVVAVVASVVFTTEGASVRVKRCAVSPCPDVI DPDHRCQGRLCRRSTRG 
GDDDDDDDDGGTFDTVGSGILGRKKRAAPPPEDEEEDDFYRKKRAAPPPEDEEEDDFYRK 
KRAAPPPEDEEEDEFYRKKRAAPPPEDEEEDEFYRKKRAAPPPEDEEEDEFYRKKRAAPP 
PEDEEEDEFYRKKRAAPPPEDEEEDEFYRKKRAAPPPEDEEEDEFYRKKRAAPPPEDEEE 
DDFYRKKRAAPPPEDEEEDDFYRKKRAAPPPEDEEEDDFYRKKRAAPPPEDEEEDDFYRK 
KR 

(SEQ ID NO: 243} 
Top Blast Hits 

Sequences producing significant alignments: 

Q18401 (Q18401) COSMID C33G8 

Q9YTL7 (Q9YTL7) ORF 48 

P73032 (P73032) HYPOTHETICAL 185.1 KD PROTEIN 

Q43687 (Q43687) EXTENS IN-LIKE PROTEIN (FRAGMENT) 

Q09085 (Q09085) EXTENSIN CLASS II PRECURSOR (CELL WALL HYDR 

014686 (014686) ALR 

Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 214 to 1051 of CT1023: this 
corresponds to nucleotides 93435 to 94272 of the genomic reference sequence. 



Score E 
(bits) Value 

113 le-24 

98 6e-20 

95 7e-19 

88 7e-17 

83 2e-15 

77 le-13 
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CT1024 
Nucleotide 
Genomic coordinates: 
Start: 94623 

Stop: 95742 (SEQ ID NO: 244) 
Amino Acid 

MDNLKGEFVALKTDLTHYKTQLDRSILVFVDVVGRLYVIWSEQTAKKEGLATRVAKQAT 
EIQQFKDEINNKYNALTNTLDDIIYI FDHGGSFKRAKHKAIIEAREYSKPLRELECMFTR 
IADMLTLTFMTVYTNIITEFRHSSEQATNSINVTLGRLFLCDDLCNQLPKEEEEEEDLKQ 
KFIT FH ANLYMLDT RLKKDL 1 1 FKDV I QQLHV I LQKDT Y AVKEGVA I RCAKQMNE I SQY R 
DNLKDN YNTFSN I LNE I V Y I FDHGGH FEEVKHKAITLTRNYLKTLMGLKCMFKRI SEMLS 
LTFLTVYTNVIAEFINASNISDREINNYLVQLVTCNELCNQLPKPKQYRPLSLIDNIAYF 
SLSVQKHLSGFL 
{SEQ ID NO: 245) 

Top Blast Hits 

Sequences producing significant alignments: 

AAF04 635 (AAF04 635) HYPOTHETICAL 43.2 KD PROTEIN 
096219 (096219) HYPOTHETICAL 139.4 KD PROTEIN 
064554 (064554) YUP8H12R.45 PROTEIN 
TPR_HUMAN (P1227 0) NUCLEOPROTEIN TPR 

Q99968 (Q99968) NUCLEAR PORE COMPLEX-ASSOCIATED PROTEIN TPR 
097291 (097291) PFC0960C PROTEIN 

Comments : 

Hit to public SBV sequences: 
gi|6165655|gb|AF099142.1: CT nucleotides 1 to 1119 match nucleotides 3220 to 
2102 of the public sequence with a 100% homology, a score of 2218 and an Evalue 
of 0.0 

TaqMan Primer/ Probe Sets: 

5 f start=4 7 8 
5»stop=499 
3'start=576 
3*stop=598 

5 •primer=TTGTGTGACGACTTGTGCAATC (residues 478 to 499 of SEQ ID N0:244) 
Tm5=58.28 

3 1 primer =AATCTTTCTTTAGGCGTGTGTCC (residues 576 to 598 of SEQ ID NO:244) 
Tm3=57.55 

probe 1 =TGCGAACCTATACATGCT (residues 558 to 575 of SEQ ID NO: 244) 

probelstart=558 

probelStop=575 

direct ion l=Forward 

Tml=69.00 

scorel=l . 99 

length=121 



Score E 
(bits) Value 

747 0.0 

39 0.051 

39 0.067 

37 0.20 

37 0.20 

37 0.26 
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CT1025 

Nucleotide 

Genomic coordinates: 

Start: 95824 

Stop: 97369 (SEQ ID NO: 246) 



Amino Acid 

MDSCCLISRITPELAGKLTWIFIPENNFKIVQNSLPDDQVISQFRYFDHRHCYTFMEILM 

ANIKIQDRKQNTTAICELTTGREGLLCRRTI PVFLGSEEKREELLGNLPEGAEI FRFREV 

MQVIGTLLDKKLEIDDGIASVKAALCAGSSSLYLIMSHIVKMTFSAITNMKDINEEYFVD 

FIFRHKQFLNPEFFKHLISLLKNSRKEHVAHLVRRLEHFI^LWTLSKMRFTEMEENYFPI 

SSDSDYGICEKCARKTPKYKLRIFRERKCCDRCCRLYHQQPPPEVYNWDGKITQQSNKGY 

INAGDEIIGMLNSNDKGKTFPPIPKMVVRRVVDGVYGQGTILSKILKFRQANIPTCLFVT 

CNKCNRIFRLTILGPTRNILCPPCRKKSVAVNTQQKGENKPSFVQKGTKRLRVDTGSNKN 

TLEKFCSWERFNTEVLLPWLGYTIESKWQNWESFLGYSSTRYKELWAFVNKQEISSMKDS 

YIKIEDI DQLLRS I LQDQKG VFET VCK I KSRDGL 

(SEQ ID NO: 247) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

AAF04634 (AAF04634) HYPOTHETICAL 45-2 KD PROTEIN 689 0.0 

STA5_MOUSE (P42230) SIGNAL TRANSDUCER AND ACTIVATOR OF TRAN 32 7.5 

AAD20715 (AAD20715) PUTATIVE DNAJ-LIKE PROTEIN 32 7.5 

CAB60246 (CAB60246) HYPOTHETICAL 80.9 KD PROTEIN (FRAGMENT) 32 9.9 

Comments : 

Hit to public SBV sequence: 

gi|6165655|gb|AF099142.1: CT nucleotides 1 to 1545 match nucleotides 2019 to 477 
of the public sequence with a 99% homology, a score of 3031 and an Evalue of 0.0 



TaqMan Primer/Probe Sets: 

5'start=672 

5*stop=693 

3'start=765 

3 , stop=786 

5'primer-CCTTTCCAAGATGAGGTTCACA (residues 672 to 693 of SEQ ID NO:246) 
Tm5=57.77 

3 , primer=ACGGAGCTTGTATTTGGGAGTT (residues 765 to 786 of SEQ ID NO:246) 
Tm3=57.84 

probe 1=TCCAGCGATAGTGATTAC (residues 721 to 738 of SEQ ID NO: 246) 

probelstart=721 

probe 1 S top=7 3 8 

direction l=Forward 

Tml=69.03 

scorel=l .96 

lengthens 
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CT1026 

Nucleotide 

Genomic coordinates: 

Start: 97547 

Stop: 98789 (SEQ ID NO: 248) 
Amino Acid 

MESIKLFTVAGLNMEQANQVAEEIKSEYKTEEEKRIAQEVFDKFTKKLIMQVDTSKHLLT 

RENPNRFVSRPIVHEDLWEMYKKEVACFWTLEEIDFERDPKDWEKLTQDEKDFILQILAF 

FASS DG I V I ENLTTRLRQVAQ I PEARS FFDFQVGMES I HGNVYGELI DRLVPDEKDKAI L 

FNAAQHFPAIKKKEQWAINWMQSNNDLAELIVAFAAVEGIFFSGAFASIFWIKNRGILPG 

LTSSNEFISRDEGLHRDFACMLLKKGFVDTPSRERILEIVTEAVRIEQEFLTVSLPVKLV 

GMNCKLMSQYIEFVADKLLVEMGLEKHYNVTNPFPFMDNISLENKTNFFEKRVAEYQRAQ 

VMASINKIKKDQQTQETGSPLPILTAPPPVSSSSSEQEDVEDGVGDYISYDDF 

(SEQ ID NO: 249) 

Top Blast Hits 







Score 


E 


Sequences producing significant alignments: 




(bits) 


Value 


Q9XYN8 (Q9XYN8) RIBONUCLEOTIDE REDUCTASE R2 SUBUNIT 




388 


e-107 


RIR2_HUMAN (P31350) RIBONUCLEOS I DE-DI PHOSPHATE REDUCTASE 


M2 


381 


e-105 


RIR2_MESAU (Q60561) RIBONUCLEOS I DE-DI PHOSPHATE REDUCTASE 


M2 


381 


e-105 


RIR2_MOUSE (PI 1157) RIBONUCLEOS I DE-DI PHOSPHATE REDUCTASE 


M2 


378 


e-104 


Q27124 (Q27124) RIBONULEOTI DE REDUCTASE SMALL SUBUNIT 




377 


e-104 


RIR2_BRARE (P79733) RIBONUCLEOS I DE-DI PHOSPHATE REDUCTASE 


M2 


376 


e-103 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has sequence identity to nucleotides 1 to 959 of CT1026: this 
corresponds to nucleotides 97786 to 98744 of the genomic reference sequence. 

Hits to public SBV sequences: 

gi|7672988|gb|AF144620.1: CT nucleotides 1 to 1242 match nucleotides 1 to 1242 
of the public sequence with a 99% homology, a score of 2454 and an Evalue of 
0.0; 

gi|6165655|gb|AF099142.1: CT nucleotides 1 to 299 match nucleotides 299 to 1 of 
the public sequence with a 99% homology, a score of 585 and an Evalue of le-169 

TaqMan Primer/ Probe Sets: 

5"start=642 

S'stop-663 

3 f start=706 

3'stop=727 

5 1 primer=TGCTGCAGTTGAAGGAATCTTC (residues 642 to 663 of SEQ ID NO: 248) 
Tm5=58. 19 

3 1 primer =AGGTGAGACCAGGCAAAATACC (residues 706 to 727 of SEQ ID NO: 248) 
Tm3=58.13 

probe 1=TTAGTGGTGCATTCGCAT (residues 665 to 682 of SEQ ID NO: 248) 

probelstart=665 

probelStop=682 

directionl -Forward 

Tml=68.98 

scorel=l . 98 

length=86 
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CT1027 
Nucleotide 

Genomic coordinates: 
Start: 988*74 

Stop: 99789 (SEQ ID NO: 250) 
Amino Acid 

MNLLPIFLTTFFVAVDACSCSTICLLPDGKKQPLVFDSVLEEVVYPTDVCGPKGAGELFT 
GVDLLTLCIGGKNNGGEWSGKGPCPRINNAVVERDYSLDEEDCKGFRKGFRIPGTDHFHT 
VFSLCWVDRDMHAKWVRNKINPGIVTDDEDLVDSGIRTKFKYSSKIFGKGFNPRPLYSLD 
YQERIKILKSHFNKRTGNFFARGHLAPAGDFFLASERWATFALENAVPQIQNHNNGEWKD 
I ENRARTTPGAAWAETG PI FYQHKKKEYLDKKKKYI P I PHALYKI VYDKNNKELFRVQS D 
MSWK 

(SEQ ID NO: 251) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

CAB55635 (CAB55635) DEOXYRIBONUCLEASE I PRECURSOR (EC 3.1.2 59 6e-08 

Q9Y2C4 (Q9Y2C4) ENGL-A 48 7e-05 

NUCG_MOUSE (008600) ENDONUCLEASE G PRECURSOR (EC 3.1.30.-) 45 0.001 

NUCG_BOVIN (P38447) ENDONUCLEASE G PRECURSOR (EC 3.1.30.-) 45 0.001 

073911 (073911) K123 PROTEIN PRECURSOR 43 0.004 

NUCG_HUMAN (Q14249) ENDONUCLEASE G PRECURSOR (EC 3.1.30.-) 42 0.005 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 808 of CT1027: this 
corresponds to nucleotides 98914 to 99720 of the genomic reference sequence. 
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CT1028 

Nucleotide 

Genomic coordinates: 

Start: 103891 

Stop: 104680 (SEQ ID NO: 252) 



Amino Acid 

MSLAVTEDYGHNEKLI KRLQTS V YHT PLLGADHVMKS I SDY 1 I SRRFMN YTNLLKQVE YV 

FDEETGAVIANICLLKILERCAQKGGIYDAPEDVAFFNSKMGEVTRLFTIIGGRPNMTVR 

VNFKHGQTNNPAYGYLTDDNDTTTVTPPVTPPPSPAARRSPFFTRTLISESSSVDHYVLM 

HDNPKRSSFKVYDIHAETFPHKAPSVPTFPPKTSFEISDVTLDCSMEIFSRDRDVLDNVH 

DY IAN DPV PFLVDVVHRGSSLR 

(SEQ ID NO: 253) 



Top Blast Hits 

Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



PAXI_CHICK (P4 9024) PAXILLIN 36 0.39 

Q24459 (Q24459) POLYCOMBL I KE NUCLEAR PROTEIN 35 0.67 

LA17_YEAST (Q12446) PROLINE-RICH PROTEIN LAS 17 34 1.5 

Q14676 (Q14676) KIAA0170 PROTEIN 33 2.6 

Q94399 (Q94399) ZK265.2 PROTEIN 32 3.4 

088699 (088699) HOMEODOMAIN PROTEIN 32 3.4 



Comments : 



TaqMan Primer/Probe Sets: 

5'start=390 

5*stop=413 

3'start=457 

3*stop=477 

5 1 pr imer=TCCTGCCTATGGTTATCTCACAGA (residues 390 to 413 of SEQ ID NO: 252) 
Tm5=58.77 

3 'primer-TCTTCTTGCAGCTGGAGATGG (residues 457 to 477 of SEQ ID NO:252) 
Tm3=59.15 

probe 1=CCTCCTGTTACTCCTCCT (residues 439 to 456 of SEQ ID NO: 252) 

probelstart=439 

probe lStop=4 56 

direct ion l=Forward 

Tml=68.94 

scorel=l . 94 

length=88 
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CT1029 

Nucleotide 

Genomic coordinates: 

Start: 104759 

Stop: 107330 (SEQ ID NO: 254) 



Amino Acid 

MASTSSSTKKRVHEEDENLIPQPKKKKSKKVLPFPVDKYRAVDKKVVNLIHKILDQEKDH 

LSSTELQMITECNGAREDLLKHLLDEGEFNPTI IEVVSSMPIETIYEILSSSADDKKFVQ 

ISLSMLIHILFFADKGTMWVSNACVQNVLGNDYKVEFENIRKKYLILEDLLNGVSNHWSE 

HGPLSHMLHSSIPIVQDMLLNRLVRYFSTYDGDAQFDISFIINSVLWGIDKSVLNELTQL 

ISRGVFIVSYVPMRVRTPSKDSNRPQNTPSQNMSALGMKLNTFSSRISVYRNNTFKKLTE 

LVHNFDYGSKDASSSSPPPPSLSDSVNTFVRLYTNYDIFLKVISDWKMPYGFFKKTFDVL 

YSKGLMTLSVSEYTLKKELVTFLRALKEREILIYKMEKRDIICILKKSLFGFNFRCLKQL 

LPLFKHFLKIEEVKHI ARFVFRDYSLMCKTQKDLQSFPAIQSASLFMEEFPWIiAKTWI DD 

DDDEGGKGHTLLTFAI VHRYPLISQLISHPILKSLVNTTCRDKHFTPLMHLANTSIMYQC 

NTLLCLIINGAKPEFINKFNENVLHIAIENVNYGVITELRGTLSSEQIEKMVNVRRMMDN 

TTPLMIALARENIVLAQLFDGLYKPKIKVRFGSSKRLRIPEFVLLKGLKESVAYLETRNI 

SYDINI IKDAVMDNSLFEEEYEI AAAGLRGNNCDPEADEKTMNTWNFFTKNSTKWASSIF 

QKNRQKFVKIVDGMNRTYEDSECAICLDSLDGDLPSGRTTCGHCFHNVCWLSLIRMSGPN 

NGSRARGGGIKCPSCRQVTCLGKRLGVADYDIETEEERDTKNVVPSVEEGRREWRKIGVD 

RYEFLVGGVWTNEIKL 

(SEQ ID NO: 255) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9Y577 (Q9Y577) RING FINGER PROTEIN TERF 4 3 0.007 

AAD27914 (AAD27914) T23015.13 PROTEIN 40 0.058 

AAD56722 (AAD56722) AUTOCRINE MOTILITY FACTOR RECEPTOR 40 0.076 

Q9WV59 (Q9WV59) RING FINGER PROTEIN TERF 39 0.13 

082239 (082239) F17A22 . 9 PROTEIN 39 0.13 

R052_MOUSE (Q62191) 52 KD RO PROTEIN (SJOGREN SYNDROME TYPE 38 0.22 



Comments : 



TaqMan Primer/Probe Sets: 

5 , start=1406 

5'stop=1425 

3'start=1491 

3 , stop=1514 

5 ' primer =AATTCCCTTGGCTTGCAAAA {residues 1406 to 1425 of SEQ ID NO:254) 
Tm5=58.29 

3 * pr imer=TGGCTTATTAAGGGATATCTGTGC (residues 1491 to 1514 of SEQ ID NO: 254) 
Tm3=57.50 

probe 1 =TTGGATCGACGACGATGA (residues 1428 to 1445 of SEQ ID NO: 254) 

probelstart=1428 

probelStop=14 4 5 

direct ion l=Forward 

Tml=68.95 

scorel=l . 95 

length=109 
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CT620 

Nucleotide 

Genomic coordinates: 

Start: 172439 

Stop: 171509 (SEQ ID NO: 256) 



Amino Acid 

MDNLITNDNIILVTFLSGLAVGCSMTIGLALAMNMLVKCIDRTTTCISCSPWEKNKNKKN 
RNGSNTESSFISHVRFNTPDKDLDISEPMLKSTTYDLANVTPQVTKLVTFSGPTYASPPT 
PRPVANTPQQQPTSTNKEEESVYMPMSSCSSSFSSDNSLPLPTPPPSPPRSNGGDYVSYV 
NGRHLKLPSNPPSPIFNIKNEEGEDDNVEEHVYEYVPEVPQQSPSIQKCIQELKEMKHKK 
NTLTRSSSNNNNNAPRITQVTFKKFPPNNNNMWENHVYGNTTIVSSTPSPTFIPSPKSII 
RKLSFKRKQ 
(SEQ ID NO: 257) 



Top Blast Hits 

Sequences producing significant alignments: 
Q24035 (Q24035) ENA POLYPEPTIDE 

Q41805 (Q41805) EXTENS IN-LIKE PROTEIN PRECURSOR 

Q40786 (Q40786) ARAB I NOGALACTAN- PROTEIN PRECURSOR 

Q9XZU9 (Q9X2U9) LARGEST SUBUNIT OF THE RNA POLYMERASE II CO 

036428 (036428) GLYCOPROTEIN PRECURSOR 

YHC4_YEAST (P38741) HYPOTHETICAL 80.1 KD PROTEIN IN SNF6-SP 



Score 
(bits) 

46 
43 
43 
42 
42 
42 



E 

Value 

4e-04 
0.002 
0.003 
0.005 
0.005 
0. 005 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 665 to 1 of CT620: this 
corresponds to nucleotides 171494 to 172158 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=494 

5'stop=513 

3*start=558 

3'stop=578 

5 ' prime r=CGCCATCTCCACCTAGAAGC (residues 494 to 513 of SEQ ID NO:256) 
Tm5=58. 4 9 

3 ' prime r=GAAGGTGGGTTTGAAGGAAGC (residues 558 to 578 of SEQ ID NO: 256) 
Tm3=58 . 68 

probel=GGCGGTGATTACGTGTCA (residues 517 to 534 of SEQ ID NO: 256) 

probelstart=517 

probelStop=534 

direct ionl=Reverse 

Tml=69. 00 

scorel=l . 99 

length=85 
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CT621 

Nucleotide 

Genomic coordinates: 

Start: 173054 

Stop: 172505 (SEQ ID NO: 258) 
Amino Acid 

MGESI FDAVSLATNNPKKSNSRNKKLLRELKNMRKDFPSTFLQCRMIDFHFSGDI IDKHY 
CHSVNVPDVVPNTIFAVFLPEEDRANNPGLYDSIEGVCITVEQGELCIINKSSVHEFNIL 
VSLHKDLFGEDILDGIETASREESRSIHLYLEAGQSIRTPIPRPEGTNTVNYTI VFSNQV 
TV 

(SEQ ID NO: 259) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

088632 (088632) SEMAPHORIN IV ISOFORM B 34 0.54 

088633 (088633) SEMAPHORIN IV ISOFORM A 34 0.54 
029244 (029244) ABC TRANSPORTER, ATP-BINDING PROTEIN 34 0.71 
054948 (054948) SEMAPHORIN IV H0MOL0G (FRAGMENT) 33 1.6 
Q13275 (Q13275) SEMAPHORIN IV PRECURSOR (SEMAPHORIN III/F) 32 2.7 
INA9_MOUSE (P09235) INTERFERON ALPHA- 9 PRECURSOR 32 3.6 

Comments : 



FIGURE 3, Sheet 139 of 160 



WO 01/38351 



181/201 



PCT/US00/28888 



CT622 

Nucleotide 

Genomic coordinates: 
Start: 183817 

Stop: 180274 (SEQ ID NO: 260) 



Amino Acid 

MEAASNLRITEGAGVLDIDNEDDINNNVDYSNLYEDEEDEEEMNEDEEEEEEEDYEDEDE 

DTGVRNGRNKDPPSSKKQSKFVRDVTNDMYDDDDEEEEEEEEEEDEEGEEGGEYDGNLED 

EEEEGDEYEDDNEGEGEEDEADPALLALAAQQEDATI IPENQWKSIVNTPSPVGPNRQVL 

PMLNFLLENVNAMGGSAGEEQKNKEDDNQQIEPVEEEEDEEEEEQEEEEEEEEEQEEEEE 

EKEPIEQEKNEPEKDEDAIENESVHSHRVESSPMSEGGNDDGMDYFFSSIAGGGNDNEED 

EEEDEEEGEEEEEEEPAQKSEEHVETKESVQSHTEYIEEEEEYEEYEDESRHTLEDEEIS 

TMHQFNNAPRVRRSPPPDIQECEDAVVFPPIMKETDILPQIKEPSPKAPRMFSILGSGGE 

EQYDQLNDIAPPPVPSIVTFPPDNEMGEESRDIMDQDSMLMPPPPPPPPPPPHQPPQLKP 

TNILLPPPPPPPTNQSLFSNNNNNPSFLSTVVGKVNNTLGGKEAEERLHKTMESIILKTR 

VKTLLETTKNLQCSELVKVVFQDPENPVKPSEKVMERLKNIIAAELTMKAFLDSAAVTDI 

KSAELFRKTNEKLELFQRKQIMSNPLFSAAYASTYIMGERASKIRPSTPAPSLKKVESIS 

ELNEDETSMSSSAGGVCAEGDESIAGGGGGGGGGGGEVVEHSSFYSNQTQANLHMELINI 

LKEDDDNQPCQTYKLGQRLAFLNNLISFKTSSAVSWSRLVNMLSDIVTKASVALFGDTNK 

AQEDFEKHQTETNDVSDLSTSSKLKQMSKESANIMEEMGLGSIGAEICFGAISTIIEKHI 

NKLCMDVGRLTIFLNIPIVLLNWPKEFTLSKDYKVLLLDSISSCSSKMAVPPIYVLNSIQ 

FDKAVDEEDEDGNGSEAEKRSEDGNMFSEKDKKEAIRRVYDNIRYGDSNDRTSLNHFFGD 

AYSGVSNNNSKNSMFDLQTQGGGRFGVAYSAGSSIIEHRSPIFDNALNTLVN FMDKRKHL 

LSAVVIKLLKKAKLSIEVYCIKYKLNQASEKYNKKGKHGKSTSVVPMRNLMYRPSKNQDV 

SPSTPAAATAM DVPSSVSSHVGRKRTFS FSNDINSNMSSASSVYIDQESSTPSRRRTFMD 

LLNNKSSVNSLAKQVKRMKHTKYYNSSSNSEDDDEDDQYE 

(SEQ ID NO: 261) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



Q9YTL7 (Q9YTL7) ORF 4 8 154 3e-36 

040947 (040947) ORF 73 148 le-34 

Q98148 (Q98148) ORF73 HOMOLOG 138 3e-31 

AAD46501 (AAD46501) LATENT NUCLEAR ANTIGEN 136 le-30 

VG48_HSVSA (Q01033) HYPOTHETICAL GENE 48 PROTEIN 133 6e-30 

Q91255 (Q91255) NF-180 130 5e-29 



Comments : 



TaqMan Primer /Probe Sets: 

5'start=1940 

S'stop=1962 

3'start=2039 

3 , stop=2058 

5 * prime r=CGACTCCTGCTCCTTCTCTTAAA (residues 1940 to 1962 of SEQ ID NO: 260) 
Tm5=57.98 

3 ' primer =TCCAGCAATAGACTCGTCGC (residues 2039 to 2058 of SEQ ID NO: 260) 
Tm3=57.66 

probe 1=GTGGAGTATGTGCTGAAG (residues 2021 to 2038 of SEQ ID NO:260) 

probe 1 s tar t=2 02 1 

probelStop=2038 

directionl^Reverse 

Tml-69.00 

scorel=l . 99 

length=119 
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CT623 

Nucleotide 
Genomic coordinates: 
Start: 196292 

Stop: 195506 (SEQ ID NO: 262) 
Amino Acid 

MSNGATISDERLILILDKIVERRGVSNLSELLIHPITKHINELLKNTVRHGDRVYMKDAE 
LDVRSRLEDIKKDCVLKAIEKQGI DVRQIITDYLAKRKLTQNLVHWYRPPISCTDIDEKI 
QQETGQVGRCSVATYNLRIGGDDGEFTRYDFSIPLGDFKITAKLFRSINDEDV DAVILVS 
RSDVVNDVLSFEAFNRTGERVVIFFNVIVEGKSKDIDIVCKSRYKHTHILNGESATYAVK 
RIKRGDTRDDILFAITAFKEE 
(SEQ ID NO: 263) 

Top Blast Hits 

Sequences producing significant alignments: 

PYC1JYEAST (P11154) PYRUVATE CARBOXYLASE 1 (EC 6.4.1.1) (PY 
BAA85008 {BAA85008) ORF3P 
BAA85073 (BAA85073) ORF4S 

PYC2_YEAST (P32327) PYRUVATE CARBOXYLASE 2 (EC 6.4.1.1) (PY 
027375 (027375) ADENINE PHOSPHORIBOSYLTRANSFERASE 
Q88444 (Q88444) GLYCOPROTEIN PRECURSOR 

Comments : 
5'start=282 
5 , stop=304 
3 f start=383 
3'stop=402 

5 * primer =GGCTAAACGAAAACTAACGCAAA (residues 282 to 304 of SEQ ID NO: 262) 
Tm5=58.58 

3 ' pr imer=CGTAGCAACACTACACCGCC (residues 383 to 402 of SEQ ID NO:262) 
Tm3=57.92 

probe 1=CCAATATCTTGCACAGAT (residues 328 to 345 of SEQ ID NO:262) 

probelstart=328 

probelStop=345 

directionl=Reverse 

Tml=68. 97 

scorel=l .88 

length=121 



Score E 

(bits) Value 

33 1.9 

33 2.5 

33 2.5 

32 5.7 

32 5.7 

31 7.5 
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CT624 

Nucleotide 

Genomic coordinates: 

Start: 195503 

Stop: 194651 (SEQ ID NO: 264) 
Amino Acid 

MSSNGDEPAVTEAEIASVMQLGAAHHDNSWITRKSDQLKYRLGAIAYSVAKNASIKYIE 
DQVRQE INSHLTNVMTFEHLYE DAFN PVICEAI FEKGI PVVMEKVYDVNRRIMEPREDFI 
TEILKEERWRRYIPGFYHTSFSFKYNTIAFTDSSTSFSVPINDKHMLSITPPGAAQGDLI 
DLSLSFKIDSSAKTLTLEFNRKSTFAGIVNRPKSWILSNLRNSDSSDNIGDYLKRNDPI 
YISHDTNGIINPSEDSASLITIHMPEIENASDDLYIDFNLFVF 
(SEQ ID NO: 265) 

Top Blast Hits 



Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



Y687_METJA (Q58100) HYPOTHETICAL PROTEIN MJ0687 
Q9Y717 (Q9Y717) CAGCR3 PROTEIN 
Q83457 (Q83457) FIBRE 

CAB53329 (CAB53329) PUTATIVE FORM YLT RAN SFERASE 
PGK_LACDE (032756) PHOSPHOGLYCERATE KINASE (EC 2.7.2.3) 
Q9XEQ1 (Q9XEQ1) TNP2-LIKE PROTEIN 



36 
33 
32 
32 
32 
32 



0.42 

2.8 

4 .7 

4.7 

6.2 

6.2 



Comments : 
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CT625 

Nucleotide 

Genomic coordinates: 

Start: 194629 

Stop: 193327 (SEQ ID NO: 266) 



Amino Acid 

MALSNNGGIYIVFAVIVLVIGASIALFFAISGVGKGTLHSNAKTKKSKKYKLDSKYTDDD 
EKTDDDNNNNGGGGGGTVDVINETALQRQTREHFARTLEKAEDEFFTKLADQEFDTYKSE 
NVWLIKDKITDGKVSIPEGDINVPDVGQAIADENLFDLIGTNHDEVKETMDEVVAQKSTN 
ITYEQLVIDLTNILLFGTVTVDPSDENGDESLQRSTDPDAEMVMLTTTPSSQLARQQQPP 
QPTPDYLARYSKELVINNIRGGFISDRDMRTWQGRMSVHVNMKQRTFNVISAATNLDSLQ 
VGLEPVLQKQGRAAVGGRIEKARIEFSFVVEGNRVRVYATNKTEDCFCSLLPNCYNVKKA 
SDYWISSASTAKEKTYLFIANKNDETSFFYNFEEGVEEIDLDI FMTI DCAPNLPFIKNLP 
RPITDNNIMVALS 
(SEQ ID NO: 267) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

096148 (096148) HYPOTHETICAL 57.8 KD PROTEIN 38 0.14 

TRIl_ECOLI (P14565) TRAI PROTEIN ( DNA HELICASE I) (EC 3.6.1 37 0.31 

Q43423 (Q43423) CYSTEINE PROTEINASE (FRAGMENT) 36 0.40 

015696 (015696) ORNITHINE DECARBOXYLASE 36 0.69 

MIX2_XENLA (Q91685) HOMEOBOX PROTEIN MIX. 2 36 0.69 

BAA84604 (BAA84604) MYOM PROTEIN 36 0.69 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 351 to 1 of CT625: this 
corresponds to nucleotides 193270 to 193620 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=651 

S T stop=669 

3»start=729 

3 f stop=748 

5 1 prime r=CCCAGACGCAGAAATGGTG (residues 651 to 669 of SEQ ID NO:266) 
Tm5-58. 66 

3 1 prime r=ACCGGGCAAGGTAATCAGGT (residues 72 9 to 74 8 of SEQ ID NO: 266) 
Tm3=59.29 

probe 1~ACAACAACAACCTCCTCA (residues 705 to 722 of SEQ ID NO: 266) 

probelstart=705 

probelStop=722 

direct ionl=Forward 

Tml=69.01 

scorel^l. 98 

length=98 
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CT626 

Nucleotide 

Genomic coordinates: 

Start: 228196 

Stop: 227989 (SEQ ID NO: 268) 



Amino Acid 

MSDMTRNIIVGLAVVVIALSMVAFMLSVTPALTGFLLGLGVSALGVTLFGCPTMKSPGGG 
NATINPVA 
(SEQ ID NO: 269) 

Top Blast Hits 

Sequences producing significant alignments: 

SUGE_PROVU (P20928) SUGE PROTEIN HOMOLOG 
Q48546 (Q48546) INTEGRAL OUTER MEMBRANE PROTEIN 
086349 (086349) HYPOTHETICAL 27.4 KD PROTEIN 

Q9Y9X7 (Q9Y9X7) 604AA LONG HYPOTHETICAL CARBON STARVATION P 
0698 67 (069867) PUTATIVE TRANSMEMBRANE TRANSPORT PROTEIN 
026370 (026370) HYPOTHETICAL 5-9 KD PROTEIN 



Score 
(bits) 

32 
32 
31 
30 
29 
29 



E 

Value 

35 
35 
0 
8 
0 



0. 
0. 
1 . 
1. 
4 . 
4.0 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1032 to 768 of CT626: this 
corresponds to nucleotides 227951 to 228215 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=60 

5'stop=80 

3'start=151 

3'stop=173 

5 * primer =CATGGTCGCTTTCATGCTTTC (residues 60 to 80 of SEQ ID NO: 268) 
Tm5=59.30 

3 ' primer =CCTGGAGATTTCATAGTGGGACA (residues 151 to 173 of SEQ ID NO: 268 ) 
Tm3=58. 94 

probe 1 =TGTTACTCCTGCACTTAC (residues 81 to 98 of SEQ ID NO:268) 

probelstart=81 

probelStop=98 

direct ion l=Forward 

Tml=68.95 

scorel=l . 95 

length=114 
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CT627 

Nucleotide 

Genomic coordinates: 

Start: 234330 

Stop: 233778 (SEQ ID NO: 270) 
Amino Acid 

MFQKWFESFLDSSRPRYLDTTCVCSVYSYFSPCRKHIKFSTSHSHEGIKIHPPSILNHNT 
SSPTSGKMCNHHHKRLYLSTDDHTRWYDKNTSCIYLEDIGGVQFMVYEFHLTPKNNQLFS 
FPVHLQIHNRNTEKTSLLVFENEEDMRVRNIHPKSKILIPVSKDTVLVENGFRYKVKIVL 
SNK 

(SEQ ID NO: 271) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

TRBH_ECOLI (PI 9381) TRBH PROTEIN 31 6.4 

BCL6_HUMAN (P41182) B-CELL LYMPHOMA 6 PROTEIN (BCL-6) (ZINC 31 6.4 

Comments : 

TaqMan Primer/Probe Sets: 

5'start=187 

5 , stop=207 

3'start=233 

3'stop=253 

5 1 prime r=CCCACCAGTGGAAAGATGTGT (residues 187 to 207 of SEQ ID NO:270) 
Tm5=58 . 36 

3 • pr imer=TCGTATGGTCGTCAGTGCTCA (residues 233 to 253 of SEQ ID NO: 270) 
Tm3=58 .89 

probe 1=CACCACAAGAGATTGTAC (residues 214 to 231 of SEQ ID NO: 270) 

probe lstart*=2 14 

probelStop=231 

direction l=Forward 

Tml=68. 98 

scorel^l . 98 

length=67 
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CT628 

Nucleotide 

Genomic coordinates: 

Start: 240139 

Stop: 239455 (SEQ ID NO: 272) 



Amino Acid 

MDSLISKLENI FSIAEQDFFNADSMFMQTMLLPTDAMFTDCESPLYKNKSGGKNIVTDVG 

ESVLSSSSDEKMSFKVLSHVLRRFPVLLHCNYKQTNTPLWKELYKHGKFALLGDLVLFSN 

PFHPNIPAMPFDKSPICDTTGKSIIMSEVMTKELLYKIADKDIGQFFAVLNVTNPITGDS 

FLHYFAGGNTMRDGEGDKICTSADVLRIIAEITIQKTGKMPYELMKK 

(SEQ ID NO: 273) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

SYT_HAEIN (P43014) THREONYL-TRNA SYNTHETASE (EC 6.1.1.3) (T 39 0.028 

SYT_ECOLI (PO0955) THREONYL-TRNA SYNTHETASE (EC 6.1.1.3) (T 38 0.049 

CDRl_SCHPO (P07334) MITOSIS INDUCER PROTEIN KINASE CDR1 (EC 35 0.55 

Q55626 (Q55626) METHYLENETETRAHYDRO FOLATE DEHYDROGENASE 32 3.7 

ERG2_SCHPO (P87113) PROBABLE C-8 STEROL ISOMERASE (DELTA-8- 32 4.8 

Q85056 (Q85056) SEGMENT 2 31 8 . 3 

Comments : 



TaqMan Primer/Probe Sets: 

5'start=246 

5 f stop=267 

3'start=288 

3'stop=305 

5 1 pr imer=GCGATTCCCTGTCCTACTTCAT (residues 246 to 267 of SEQ ID NO:272) 
Tm5=58.22 

3 , primer=TCCTTCCACAGGGGCGTA (residues 288 to 305 of SEQ ID NO:272) 
Tm3=58 . 60 

probe 1=CAACTACAAGCAGACGAA (residues 270 to 287 of SEQ ID NO: 272) 

probe lstart=270 

probe 1 S top=2 87 

direct ion l=Forward 

Tml-69. 05 

scorel=l . 94 

length=60 
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CT62 9 

Nucleotide 

Genomic coordinates: 

Start: 247143 

Stop: 244950 (SEQ ID NO: 274) 



Amino Acid 

MEGGDQRTKLTPATVMGLYQSKTPGEGEGGEGGGQFKIPSAIAVKSCCSKNATRRSPPSD 

SPYSLRPMKRLKKNNGEVGGKAPPPVTLRLREDYESTPYNFNRNKKKRPITIDENQFATL 

NPTYATDIIKKQQLPSVSAASVLRKHRANADTQYRKRFSHPNCAKFSTVNLKARDYTPLS 

VLRSHVKGPKHLKSSCDTVTETNWKRNFSS I DKWVKLEKPPCYFAVAEADTNIAAGLES 

PFHLIRQAAKLGLISDVQDVSSNYETIKQSCIDAKEKASKFLWSNNRTKQPPSSWWPVGF 

GSKNLSVLDTSPLLNWNRLCKNNGKGWIKTMSIDHMAKNVFKLSPGACESILEKKTTLLG 

EVTAQCKKWESYRRNIPVPAHVQPEYASQVVMIGPSELYLEVKVGVYYMLETGKVIKFMT 

DKEMYCEFVFETVFSHALEGRMKGAVGVRKMCVEGFCVEMDFAGISVIDVLNGDLKCKMD 

ENVVQQPNPSTTSSKPAAELMQDHGSLCRMRDTLYGVRMLQATGRLPEGLQSKCKKPITD 

SISAIAIVGKMRERMLNQLPFVLVEIVNIVTRLSQQGLVNPDIKSDNIVI DGITGQPKMI 

DFGLIVPCKKYYNFKCWGTDERFFSNHPHTAPEFINSELCSETAMTFGLAYLLIDMLSIL 

IKRTADLSANSIYTNIPFLSIVSKMYDQEKTNRPRAYEI APVIGACFPFKDNIAKLFQSP 

KHSLYSKKVK 

(SEQ ID NO: 275) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q20443 (Q20443) PRK-2 PROTEIN 46 0.001 

Q17735 (Q17735) SIMILAR TO PROTEIN KINASE 44 0.004 

KPC1_LYTPI (Q25378) PROTEIN KINASE C (EC 2.7.1.-} 43 0.007 

001715 (001715) PROTEIN KINASE C 43 0.010 

001716 (001716) PROTEIN KINASE C (FRAGMENT) 43 0.010 
Q21734 (Q21734) PUTATIVE RIBOSOMAL PROTEIN S6 KINASE II ALP 42 0.017 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 933 of CT629: this 
corresponds to nucleotides 24 4994 to 245926 of the genomic reference sequence. 

TaqMan Primer/ Probe Sets: 

5'start=933 

5 f stop=954 

3 f start=1020 

3'stop=1040 

5 , primer=CCCTCTCTTGAACTGGAACAGG (residues 933 to 954 of SEQ ID NO: 274) 
Tm5=58.83 

3 ' prime r=GCTCCAGGGGAAAGCTTAAAA (residues 1020 to 1040 of SEQ ID NO: 274) 
Tm3=58. 49 

probe 1=TGAGCATCGATCACATGG (residues 992 to 1009 of SEQ ID NO:274) 

probe Is tart-992 

probelStop=1009 

directionl=Forward 

Tml=68. 98 

scorel=1.98 

length=108 
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CT1030 

Nucleotide 

Genomic coordinates: 

Start: 108549 

Stop: 109164 (SEQ ID NO: 276) 
Amino Acid 

MSSGKVTYEIVEGGLLNNKYLLDGGAAICLQSNCVARKRHAGSLHDNLFKMLGFGDPYKQ 
RRGKTNSKNLAIIEDRPQLGSVSVVQHPTEPERFCSMTFLFAQYNMGNGRKCYFPNDKEY 
VESCKKHERVHKSSTEMKRLRLYYFNKCLHAIAKSPAMKKYNKI IFPARIGCAAAGGDWE 
KYHASIRDFSTI IDKEVIIVSQRM 
( SEQ ID NO: 277) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9YTM9 (Q9YTM9) ORF 36 32 4.3 

Q9XAQ7 (Q9XAQ7) NUOD, NADH DEHYDROGENASE SUBUNIT 31 9.7 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 604 of CT1030: this 
corresponds to nucleotides 10854 6 to 10914 9 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=284 

5*stop=307 

3*start=387 

3'stop=408 

5 1 prime r=GCTCCATGACATTCTTATTTGCTC (residues 284 to 307 of SEQ ID NO: 276) 
Tm5=57.98 

3 ' primer=TTCTGTGGAAGATTTGTGGACC (residues 387 to 408 of SEQ ID NO:276) 
Tm3=57 . 62 

probe 1=TGTTGAGAGCTGCAAGAA (residues 360 to 377 of SEQ ID NO: 276) 

probe Is tar t=360 

probelStop=377 

d i rec t i on 1 Forward 

Tml=69.08 

scorel=l . 91 

length=125 
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CT1031 

Nucleotide 

Genomic coordinates; 

Start: 109260 

Stop: 110088 (SEQ ID NO: 278) 
Amino Acid 

MSSNRFSQLRGNEEMVGDYSRWTTVKNRRNRQQQYSHSFRPQQQQQHQKRTSTNSPPAPP 

PPFPIISWGALGSYSMYRLDDQCRNCDETGYYNFHSYDRKRERVRSLNNTPSEGMWRRTS 

RSSPFLNKKKDVDEAPPPQSNQHMYPLNKYSFREYTPSSKLVNWRDPSQEKQDKILQEEE 

ARAPTPTPQEKEPEVETKDDVVIEEETAPEPEPEPAPVPDPDIPAITATTTTTTVATRHD 

DSSTVFLRNVILSIVFWFLGVYSALFAKCIRSKKE 

(SEQ ID NO: 279) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q01823 (Q01823) ORF-3 48 5e-05 

PAR1_TRYBB (P08469) PROCYCLIC FORM SPECIFIC POLYPEPTIDE Bl- 48 7e-05 

Q99356 (Q99356) PROCYCLIN 48 9e-05 

P73032 (P73032) HYPOTHETICAL 185.1 KD PROTEIN 48 9e-05 

PARC_TRYBB (Q06084) PROCYCLIC FORM SPECIFIC POLYPEPTIDE B-A 48 9e-05 

Q27045 (Q27045) SCHI Z0NT/SPORO2OITE SURFACE PROTEIN ( FRAGME 48 9e-05 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has sequence identity to nucleotides 137 to 680 of CT1031: this 
corresponds to nucleotides 109401 to 109944 of the genomic reference sequence. 

TaqMan Primer /Probe Sets: 

5'start=255 

5.'stop=277 

3'start=347 

3 f stop=364 

5'primer=TTGCGATGAAACTGGCTATTACA (residues 255 to 277 of SEQ ID NO: 278) 
Tm5=58.44 

3'primer=»ATCTACTTGTGCGCCGCC (residues 347 to 364 of SEQ ID NO: 278) 
Tm3=58.4 2 

probe 1 =CTCCAAGTGAAGGCATGT (residues 329 to 346 of SEQ ID NO: 278) 

probe 1 star t=32 9 

probelStop-34 6 

direct ionl=Forward 

Tml=68 . 97 

scorel=l . 97 

length=110 
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CT1032 
Nucleotide 

Genomic coordinates: 
Start: 119056 

Stop: 121081 (SEQ ID NO: 280) 
Amino Acid 

MAGNRTQFVSSLIAKCISDVEQGMECCGRQAQDALMTRLANLKLGDSLKETDVNLEYLRY 
ASTPLLGELNYDKQQYAATVDINLMAHFSYAALGIESILNSIRRWVANHQRRNNGKKPS 
EPISRPHPLGGVEPPLSSELANAIRDKFISMGALDRLNSAIVTAALGAIASERELFLREN 
AVNYMYDVEFAERDAATTDTGNVVYLSTKMDEDEDDIIKRSEILDKVSKRPAKEGIDWRP 
TPDNSFPYQLIWGDDSVDDTVLIDLITNAIVPNIFMAKFILFICNHLRAVIRSMREILYG 
NISSSSDNYFEDGRKWCFWLNLYNRLEWFMLVVRFVIFLHSKKESFSGADNVNVKRLLVV 
VVESFPPVLLDTEWVKTNITSWPVINNSNNNSTLPVTEDTLMRLAIRTSSGARHPIFDEI 
NSLTTAVTNRI TFQSAEFCTKI LLGRALDEEEAGTKMLVKSVKETGEEKDKNNTFSSFGL 
LLKN T KN EELE INIGDNDDETT DVACW ART S S T S FI RNRT YA FKKI WGLE DAS D VVELKR 
ESDAITSFVTDKSSPLLFPYVSDWSCLLLHPCCKAPAIIKSVWLQILKDFSQENIKTINE 
KVQSLSSEICQKSNDRFKNKKIAAEHVRSVKKLLNTISNREQEAALSTEHCIWLTILWKQ 
VVQNTLNLLENFPV 
(SEQ ID NO: 281) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

025761 (025761) HYPOTHETICAL 88.8 KD PROTEIN 38 0.29 

075130 (075130) KIAA0635 PROTEIN 34 3.3 

Q17425 (Q17425) HYPOTHETICAL 62.4 KD PROTEIN B0024 . 10 IN CH 34 3.3 

033600 (033600) PURINE NTPASE 34 4.4 

Y066_NPVAC (P414 67) HYPOTHETICAL 94.0 KD PROTEIN IN POL-LEF 34 4.4 

Q53626 (Q53626) ATP-BINDING PROTEIN INVOLVED IN MITHRAMYCIN 33 5.7 

Comments: 
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CT1033 

Nucleotide 

Genomic coordinates: 

Start: 121099 

Stop: 123634 (SEQ ID NO: 282) 



Amino Acid 

MFTHLTRAFRKMNNLVNRSFIDVHRVVAELSYPEFEEDVKNPESSIYRTPISLFQNKDIV 
TIVGDYILSPKTDS FQVLYPIKKVIEHFPVI FHCTHNNAPLWVHLLDERHHRLLQSLLTY 
EIVNAKYRGIWIPYYRRPINYQTGKSLLMSKLASVKVLDILMRCGSYKFISLMCMINKK 
NNTNFLHCCASKWGEVGSKMMLHIAEMFFANPTTSQHLSDASSFPDAAAEDDKGKTPAHL 
AIQEDNADALLFLISLYGAPWFQDNNSYMKSALELKSNKCVKVLSFAADKYEILPNINNN 
QLEPDTMCGVCATSVEEDENEGKTTSLSWYQMNCKHYIHCECLMGMCAAAGNVQCPMCRE 
DVGDEVLERCPPTI FRWLKLAERSEHNRVLFEAKKQE FYKQMEAMKPPRVVVPPRRTFLT 
PARRGERAIRIAREIATNAIAEATAQGDVNSYFPVLI DGSGEEYEEEGEEFFNSEEEALA 
FGRPFLEDEEEARQ IQMRQFAELSRRGVS VN 1 1 NNDNPHRH I S T VN I VQPV YGVEKS PAA 
SFIYNMLKNDVFESIRSRDTRVGGERVPVMNLSNDKRALFHAASSMLCDFATETNSQIVG 
LDFQAVYDPHHISNYIETFGSPLHAYPGAVTFLDGAQDYYAESIRYDNDIVSFSEMASEL 
HITEALDVFEGSLLSPLFKKIRTGKSYSNWN DHLRRRNYARDIAEEFVRVCENSLASREH 
PPVHVHPFRDGAIPILIEYIVDFIHHCITWSMQVNALHCMRKYIEHENTNVHLLNLRPTD 
ERVEVLRVSQLRWSRLFNEQYNTRMSLSTKRLSLMKI FNHDLGVSKFGVYKLLDIIEMYC 
FTLI 

(SEQ ID NO: 283) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



Q9XUM8 (Q9XUM8) W02A11.3 PROTEIN 
YEYC_CAEEL (Q21802) HYPOTHETICAL 64.6 
CAB40955 (CAB40955) HYPOTHETICAL 22.5 
YBR2_YEAST (P38239) HYPOTHETICAL 13.2 
Q47955 (Q47955) HHDA PRECURSOR 
0B2239 (082239) F17A22 . 9 PROTEIN 



39 0.17 

KD PROTEIN R07B7.12 I 38 0.22 

KD PROTEIN 37 0.50 

KD PROTEIN IN ORC2-TI 37 0.50 

37 0.50 

37 0.50 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has sequence identity to nucleotides 1 to 208 of CT1033: this 
corresponds to nucleotides 12344 9 to 123656 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=1206 

5'stop=1225 

3*start=1269 

3*stop=1287 

5*priiaer=GGAAGCAATGAAACCTCCCA (residues 1206 to 1225 of SEQ ID NO:282) 
Tm5=58.56 

3 , primer=GATGGCTCGTTCGCCTCTT (residues 12 69 to 1287 of SEQ ID NO: 282) 
Tm3=58 . 97 

probe 1=TTGTTCCTCCTCGCAGGA (residues 1232 to 1249 of SEQ ID NO:282) 

probelstart=1232 

probelStop»124 9 

direct ionl=Forward 

Tml=68. 99 

scorel=l . 99 

length=«82 
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CT1034 
Nucleotide 
Genomic coordinates : 
Start: 123757 

Stop: 126556 (SEQ ID NO: 284) 
Amino Acid 

MAADLLELAIQETIQSELEEIADTEFLNYLPHKTGICEEAAANGRPYLPTLEMRNEVDHF 
WSQDNRKLKLLGHFCGNLYVEAFIAGSIDAETCVGFLRSQATGLGYPLLKKLALIAREDK 
SNTTNYNLYIDRNSMMKQVFSAEIDKRPSSIQNTSHTKSSPVYLKLIDRRTECLALDWLD 
ASKRTAKEIGAARKVCFLQNLIVAILIPAYTETFVLDTGNELEQQVLDDAYFNAENKDKV 
DEMCVVAILSTLHNLFVRKSLPHHLYNAPFRLPPFGQHPIINIENSSFFNEDTTPILASI 
SIPSSMVIKHHTRKNSRWRCPNNLMTAAERSIFLRGVLTVSGDYGWFSVIVGSTIMPSVL 
FYGDRKHLINTVKSNNFSAITCSYWNKYMDCRSYGFEI IDTPENNCGFRIRAAIDCSNTD 
FHSPVTRVNKKKTSI INAVKNPFFIRHTEPKWYNKNAMCGEVLENVGVTLEQHVRVSDEY 
MDRFGSLIiLGREKKWTCNYLDRIKSLETISNNLKGKIDTMCKILETKYNYKSSSLYYKQI 
TATSDDPIKMKIIASINKRRYLCNILEFAIISSEKKDEVEEDHTKTGNGGCAFSKYKKKQ 
LEPKQHLIVKVNKYIEAFSLIKMLRNDCERNKCRFKEAEIRECANELVRELYRASARSYV 
HDLVLKRTNVHLTWQRPYDENANTIMSLIPKCKLHTVLYDKDSRDVKLLNFLRTRDGNYN 
PIRHSMLELVYGEEYAKDVSTVTCFEWLKWCSKKGVIKYEDFLDRYEKTGEEDKDEREFF 
RLKKCSRDHTKDIKKIENVLNSDTLYSYSLDKNVQTHASSSTVVKNDTDGKTSMVGWDYI 
FSIGKGEKTTKKRKLETIDISSSDDDDEEEEEEDEGKRMKMNNCSSSIKNKSKNKNGRMC 
CTDILNVVEPSLPNTLSFNCVKSMDVLNLL 
(SEQ ID NO: 285) 

Top Blast Hits 

Sequences producing significant alignments: 
062235 (062235) F36F2.3 PROTEIN 

AAF04 4 42 (AAF04 4 42) PUTATIVE TRANSLATION INITIATION FACTOR 
060313 (060313) KIAA0567 PROTEIN (FRAGMENT) 
Q904 91 (Q90491) DNA BINDING PROTEIN E12 
BAA82145 (BAA82145) MYOSIN HEAVY CHAIN 2B 
097303 (097303) PFC1060C PROTEIN 
Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 1 to 647 of CT1034 : this 
corresponds to nucleotides 125649 to 126295 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start~1391 

5'stop=1410 

3 , start=1468 

3'stop=1489 

5 , primer=AAAATGTTGGCGTGACCCTC (residues 1391 to 1410 of SEQ ID NO:284) 
Tm5-58.49 

3 * primer =ACGTCCATTTCTTTTCTCGTCC (residues 1468 to 1489 of SEQ ID NO:284) 
Tm3=58.24 

probel=AACAACACGTCCGTGTTA (residues 1412 to 1429 of SEQ ID NO:284) 

probe lstart=14 12 

probelStop=1429 

directionl=Reverse 

Tml=69.00 

scorel=l . 99 

length=99 



Score E 

(bits) Value 

46 8e-04 

43 0.012 

41 0.037 

41 0.048 

40 0.063 

40 0.063 
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SBV.ORF022 .txt 
Nucleotide 

Genomic Coordinates: 
Start: 265664 

Stop: 266297 (SEQ ID NO:286) 

Comments: EST confirmation of transcript not predicted computationally. 

TaqMan Primer/Probe Sets: 

5»start=230 

5*stop=252 

3' start=321 

3'stop=343 

5 ' primer=CAAATCAAAGGCCATATGAAACC (residues 230 to 252 of SEQ ID NO: 286) 
Tm5=58.41 

3 ' prime r=GGTGGTCTTTATTTTCATCACCG (residues 321 to 343 of SEQ ID NO: 286) 
Tm3=58 . 41 
primerScore=0 - 86 
allelel^ 

probe 1 -GAGGAT AGCCAGTTCTAT (residues 265 to 282 of SEQ ID NO: 286) 

probelstart=265 

probe lStop=2 82 

di rect ion 1— Reverse 

Tml=68 . 99 

scorel=l . 99 

length=114 
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C003 

Nucleotide 

Genomic Coordinates: 

Start: 126709 

Stop: 127126 (SEQ ID NO: 287) 

Comments: EST confirmation of transcript not predicted computationally. 

TagMan Primer/Probe Sets: 

5'start=175 

5'stop=194 

3'start=265 

3*stop=283 

5 ' primer-CATGGTGTGCCTAAAGCAGG (residues 175 to 194 of SEQ ID NO:287) 
Tm5=57. 84 

3 1 primer =TGGCACAATCATGGGTTGA (residues 265 to 283 of SEQ ID NO: 287) 
Tm3=57 . 81 
primerScore=0 . 91 
allelel= 

probe l^AATGGAGGCATCGAAGAA (residues 212 to 229 of SEQ ID NO: 287) 

probe 1 star t=^212 

probelStop=229 

direction l=Reverse 

Tml=69. 02 

scorel=l . 97 

length=109 
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C010 

Nucleotide 

Genomic Coordinates: 

Start: 184094 

Stop: 184931 (SEQ ID NO: 288) 

Comments: EST confirmation of transcript not predicted computationally. 

TaqMan Primer/Probe Sets: 

5'start=547 

S , stop=570 

3*start=655 

3*stop=678 

5 1 prime r=AAGATGATGAAACCTGGTTCAAAA {residues 547 to 570 of SEQ ID NO: 288) 
Tm5=57.75 

3 1 primer=CGTCATGGAAGAAAAGGAATTGTA (residues 655 to 678 of SEQ ID NO:288> 

Tm3-58.24 

primerScore=0 . 67 

allelel= 

probe 1 =CTCCTTGCAAGTGCGATA (residues 598 to 615 of SEQ ID NO: 288) 

probelstart=598 

probe lStop=6 15 

directionl=Forward 

Tml=69.01 

scorel=l . 98 

length=132 
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C020 

Nucleotide 

Genomic Coordinates : 
Start: 60122 

Stop: 60341 (SEQ ID NO: 289) 

Comments: EST confirmation of transcript not predicted computationally. 

TaqMan Primer/Probe Sets: 

5»start=2 

5'stop~26 

3 , start=118 

3'stop=140 

5 • primer=CAACCTCCACATCTCCTTCTAATTC (residues 2 to 26 of SEQ ID NO: 289) 
Tm5=58.27 

3 , primer=AACGCTTTGTTTAACGTGCTTTT (residues 118 to 140 of SEQ ID NO: 289) 

Tm3=58.18 

primerScore=0 . 61 

allelel= 

probe 1=ATTCAAGAGGTCACAAACA (residues 80 to 98 of SEQ ID NO: 289) 

probelstart=80 

probelStop=98 

direct ionl=Forward 

Tml=68.75 

scorel=l .50 

length=139 
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C028 

Nucleotide 

Genomic Coordinates : 

Start: 277009 

Stop: 277324 (SEQ ID NO: 290) 

Comments: EST confirmation of transcript not predicted computationally. 

TaqMan Primer/Probe Sets: 

5'start=293 

5'stop»315 

3 f start=384 

3'stop=406 

5 , primer=GGTGCAGTTTCATCCTTACCATT (residues 293 to 315 of SEQ ID NO: 290) 
Tm5=58.07 

3 ' primer-TCGTCATCATAGTCGTCGTCAAC (residues 384 to 406 of SEQ ID NO: 290) 

Tm3=58.72 

primerScore=0 . 86 

allelel^ 

probe 1=TTCCACAACCACCACTAC (residues 320 to 337 of SEQ ID NO: 290) 

probe lstart=320 

probelStop=337 

direct ion 1= Forward 

Tml=68.94 

scorel=l . 94 

length=114 
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C036 

Nucleotide 

Genomic Coordinates: 

Start: 115067 (SEQ ID NO; 291) 

Stop: 115380 

Comments: EST confirmation of transcript not predicted computationally. 

TaqMan Primer/Probe Sets: 

5'start=99 

5»stop=117 

3'start=158 

3'stop=177 

5 * primer =CCACTGCTGGACGCATCTC (residues 99 to 117 of SEQ ID NO: 291 J 
Tm5=58. 94 

3 * primer=CGACGGACAGTGGAGCTCTT (residues 158 to 177 of SEQ ID NO: 291) 

Tm3=59.04 

priraerScore=l .25 

allelel= 

probe 1=CCATCAATGAAGAAGCGT (residues 131 to 148 of SEQ ID NO: 291) 

probe 1 start =131 

probelStop=148 

direct ion 1= Forward 

Tml=68. 97 

scorel=1.97 

length=79 
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C037 

Nucleotide 

Genomic Coordinates: 

Start: 186828 

Stop: 187047 (SEQ ID NO: 292) 

Comments: EST confirmation of transcript not predicted computationally. 

TaqMan Primer/Probe Sets: 

5'start=63 

5'stop=82 

3'start=133 

3*stop=157 

5 ' primer-TTGACATCAGACCGACCCAG (residues 63 to 82 of SEQ ID NO: 292) 
Tm5=S8.08 

3 ' pr imer=GGAATTTTACTCTTCCTCCATCTCA (residues 133 to 157 of SEQ ID NO; 292) 

Tm3=57.92 

primerScore=0 . 62 

allelel= 

probe 1 =CACCCTCTAAACTCGAGC (residues 88 to 105 of SEQ ID NO: 292) 

probelstart=88 

probelStop=105 

direct ionl=Forward 

Tml=69.49 

scorel=1.50 

length=95 
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CO 19 

Nucleotide 

Genomic Coordinates : 

Start:141541 

Stop: 141647 (SEQ ID NO: 293) 

Comments: EST confirmation of transcript not predicted computationally. 
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